Optimizing Intrusion Detection Systems: A Machine Learning-Based Feature Selection Approach for Enhanced Cybersecurity
Essarghi Hiba Allah () and
Darouichi Aziz ()
Additional contact information
Essarghi Hiba Allah: Cadi Ayyad University, Faculty of Science and Technology, L2IS
Darouichi Aziz: Cadi Ayyad University, Faculty of Science and Technology, L2IS
A chapter in Reliability in Cyber-Physical Systems: The Human Factor Perspective, 2026, pp 147-161 from Springer
Abstract:
Abstract The rapid evolution of technologies such as artificial intelligence and big data has led to an increase in sophisticated cyberattacks, making network security an urgent concern. Intrusion Detection Systems (IDS) have been widely studied, with machine learning playing an important role in enhancing their effectiveness. However, developing a high-performing IDS remains challenging due to the complexity of high-dimensional data and limitations in detection accuracy. Effective feature selection is essential for improving IDS performance by reducing dimensionality, enhancing interpretability, and preventing overfitting. In this study, we propose a comprehensive data preprocessing pipeline to optimize feature selection and apply it to the UNSW-NB15 dataset, a benchmark for cybersecurity threat detection. We evaluate the impact of several feature selection techniques on four machine learning models: Logistic Regression, K-Nearest Neighbors (KNN), Decision Tree, and Random Forest. The selection methods include XGBoost-based selection with correlation analysis, Chi-square test, Recursive Feature Elimination (RFE), Fisher’s test and CatBoost-based selection. We analyze model robustness across various feature subset sizes, and evaluate performance using Accuracy, Precision, Recall, and F1-score. Our findings demonstrate that feature selection significantly impacts model performance, with Random Forest achieving the highest accuracy (94.96%) using only 25 features selected via XGBoost and correlation. These results highlight the importance of optimal feature selection for designing more effective IDS solutions.
Keywords: Intrusion detection systems (ids); Machine learning; Cybersecurity; Feature selection; UNSW-NB15 dataset (search for similar items in EconPapers)
Date: 2026
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:ssrchp:978-3-032-09917-4_10
Ordering information: This item can be ordered from
http://www.springer.com/9783032099174
DOI: 10.1007/978-3-032-09917-4_10
Access Statistics for this chapter
More chapters in Springer Series in Reliability Engineering from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().