Insider threat detection using supervised machine learning algorithms
Phavithra Manoharan (),
Jiao Yin (),
Hua Wang (),
Yanchun Zhang () and
Wenjie Ye ()
Additional contact information
Phavithra Manoharan: Zhejiang Normal University
Jiao Yin: Zhejiang Normal University
Hua Wang: Zhejiang Normal University
Yanchun Zhang: Zhejiang Normal University
Wenjie Ye: Zhejiang Normal University
Telecommunication Systems: Modelling, Analysis, Design and Management, 2024, vol. 87, issue 4, No 2, 899-915
Abstract:
Abstract Insider threats refer to abnormal actions taken by individuals with privileged access, compromising system data’s confidentiality, integrity, and availability. They pose significant cybersecurity risks, leading to substantial losses for several organizations. Detecting insider threats is crucial due to the imbalance in their datasets. Moreover, the performance of existing works has been evaluated on various datasets and problem settings, making it challenging to compare the effectiveness of different algorithms and offer recommendations to decision-makers. Furthermore, no existing work investigates the impact of changing hyperparameters. This paper aims to objectively assess the performance of various supervised machine learning algorithms for detecting insider threats under the same setting. We precisely evaluate the performance of various supervised machine learning algorithms on a balanced dataset using the same feature extraction method. Additionally, we explore the impact of hyperparameter tuning on performance within the balanced dataset. Finally, we investigate the performance of different algorithms in the context of imbalanced datasets under various conditions. We conduct all the experiments in the publicly available CERT r4.2 dataset. The results show that supervised learning with a balanced dataset in RF obtains the best accuracy and F1-score of 95.9% compared with existing works, such as, DNN, LSTM Autoencoder and User Behavior Analysis.
Keywords: Cybersecurity; Insider threat; Imbalanced dataset; Supervised learning (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11235-023-01085-3 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:telsys:v:87:y:2024:i:4:d:10.1007_s11235-023-01085-3
Ordering information: This journal article can be ordered from
http://www.springer.com/journal/11235
DOI: 10.1007/s11235-023-01085-3
Access Statistics for this article
Telecommunication Systems: Modelling, Analysis, Design and Management is currently edited by Muhammad Khan
More articles in Telecommunication Systems: Modelling, Analysis, Design and Management from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().