Bi-LSTM-XGBoost ensemble-based intrusion detection system: Addressing data imbalance and enhancing minority class performance
Woo-Seong Kim () and
Hyun-Jung Kim ()
Edelweiss Applied Science and Technology, 2025, vol. 9, issue 3, 2993-2999
Abstract:
Intrusion Detection Systems (IDS) are critical in identifying abnormal network activities and mitigating potential security threats. However, existing IDS solutions struggle with detecting rare attack types, such as Remote-to-Local (R2L) and User-to-Root (U2R), primarily due to data imbalance. To address this challenge, we propose an ensemble model combining Bidirectional Long Short-Term Memory (Bi-LSTM) networks and eXtreme Gradient Boosting (XGBoost). Our model achieves an accuracy of 98.42% on the NSL-KDD dataset, significantly reducing the false positive rates for R2L and U2R classes by approximately 90% and 67%, respectively (p-value < 0.05). Moreover, the proposed model achieves an Area Under the Receiver Operating Characteristic Curve (AUC-ROC) score of 0.89 for R2L detection, outperforming the Bi-LSTM-Random Forest baseline (0.88). For U2R detection, the AUC improved from 0.58 to 0.66. These findings highlight the model's enhanced capability for minority class detection and its potential to mitigate data imbalance issues in IDS. Future work will focus on integrating Conditional Generative Adversarial Networks (Conditional GANs) for data augmentation, optimizing hyperparameters using Particle Swarm Optimization (PSO), and validating the model's generalizability on CICIDS2017 and UNSW-NB15 datasets.
Keywords: Bi-LSTM; XGBoost; Ensemble Model; Intrusion Detection System; Minority Class Detection. (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://learning-gate.com/index.php/2576-8484/article/view/5899/2125 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ajp:edwast:v:9:y:2025:i:3:p:2993-2999:id:5899
Access Statistics for this article
More articles in Edelweiss Applied Science and Technology from Learning Gate
Bibliographic data for series maintained by Melissa Fernandes ().