Insider threat detection using supervised machine learning algorithms

Manoharan, Phavithra; Yin, Jiao; Wang, Hua; Zhang, Yanchun; Ye, Wenjie

Insider threat detection using supervised machine learning algorithms

Phavithra Manoharan (), Jiao Yin (), Hua Wang (), Yanchun Zhang () and Wenjie Ye ()
Additional contact information
Phavithra Manoharan: Zhejiang Normal University
Jiao Yin: Zhejiang Normal University
Hua Wang: Zhejiang Normal University
Yanchun Zhang: Zhejiang Normal University
Wenjie Ye: Zhejiang Normal University

Telecommunication Systems: Modelling, Analysis, Design and Management, 2024, vol. 87, issue 4, No 2, 899-915

Abstract: Abstract Insider threats refer to abnormal actions taken by individuals with privileged access, compromising system data’s confidentiality, integrity, and availability. They pose significant cybersecurity risks, leading to substantial losses for several organizations. Detecting insider threats is crucial due to the imbalance in their datasets. Moreover, the performance of existing works has been evaluated on various datasets and problem settings, making it challenging to compare the effectiveness of different algorithms and offer recommendations to decision-makers. Furthermore, no existing work investigates the impact of changing hyperparameters. This paper aims to objectively assess the performance of various supervised machine learning algorithms for detecting insider threats under the same setting. We precisely evaluate the performance of various supervised machine learning algorithms on a balanced dataset using the same feature extraction method. Additionally, we explore the impact of hyperparameter tuning on performance within the balanced dataset. Finally, we investigate the performance of different algorithms in the context of imbalanced datasets under various conditions. We conduct all the experiments in the publicly available CERT r4.2 dataset. The results show that supervised learning with a balanced dataset in RF obtains the best accuracy and F1-score of 95.9% compared with existing works, such as, DNN, LSTM Autoencoder and User Behavior Analysis.

Keywords: Cybersecurity; Insider threat; Imbalanced dataset; Supervised learning (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s11235-023-01085-3 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:telsys:v:87:y:2024:i:4:d:10.1007_s11235-023-01085-3

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/11235

DOI: 10.1007/s11235-023-01085-3

Access Statistics for this article

Telecommunication Systems: Modelling, Analysis, Design and Management is currently edited by Muhammad Khan

More articles in Telecommunication Systems: Modelling, Analysis, Design and Management from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().