Resampling Imbalanced Network Intrusion Datasets to Identify Rare Attacks
Sikha Bagui (),
Dustin Mink,
Subhash Bagui,
Sakthivel Subramaniam and
Daniel Wallace
Additional contact information
Sikha Bagui: Department of Computer Science, University of West Florida, Pensacola, FL 32514, USA
Dustin Mink: Department of Computer Science, University of West Florida, Pensacola, FL 32514, USA
Subhash Bagui: Department of Mathematics and Statistics, University of West Florida, Pensacola, FL 32514, USA
Sakthivel Subramaniam: Department of Computer Science, University of West Florida, Pensacola, FL 32514, USA
Daniel Wallace: Department of Computer Science, University of West Florida, Pensacola, FL 32514, USA
Future Internet, 2023, vol. 15, issue 4, 1-24
Abstract:
This study, focusing on identifying rare attacks in imbalanced network intrusion datasets, explored the effect of using different ratios of oversampled to undersampled data for binary classification. Two designs were compared: random undersampling before splitting the training and testing data and random undersampling after splitting the training and testing data. This study also examines how oversampling/undersampling ratios affect random forest classification rates in datasets with minority dataor rare attacks. The results suggest that random undersampling before splitting gives better classification rates; however, random undersampling after oversampling with BSMOTE allows for the use of lower ratios of oversampled data.
Keywords: imbalanced data; resampling; rare attacks; network intrusion datasets; minority data; oversampling; BSMOTE; random undersampling; random forest (search for similar items in EconPapers)
JEL-codes: O3 (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/1999-5903/15/4/130/pdf (application/pdf)
https://www.mdpi.com/1999-5903/15/4/130/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jftint:v:15:y:2023:i:4:p:130-:d:1110552
Access Statistics for this article
Future Internet is currently edited by Ms. Grace You
More articles in Future Internet from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().