EconPapers    
Economics at your fingertips  
 

CybAttT: A Dataset of Cyberattack News Tweets for Enhanced Threat Intelligence

Huda Lughbi (), Mourad Mars and Khaled Almotairi
Additional contact information
Huda Lughbi: College of Computing, Umm-Alqura University, Mecca 24382, Saudi Arabia
Mourad Mars: College of Computing, Umm-Alqura University, Mecca 24382, Saudi Arabia
Khaled Almotairi: College of Computing, Umm-Alqura University, Mecca 24382, Saudi Arabia

Data, 2024, vol. 9, issue 3, 1-16

Abstract: The continuous developments in information technologies have resulted in a significant rise in security concerns, including cybercrimes, unauthorized access, and cyberattacks. Recently, researchers have increasingly turned to social media platforms like X to investigate cyberattacks. Analyzing and collecting news about cyberattacks from tweets can efficiently provide crucial insights into the attacks themselves, including their impacts, occurrence regions, and potential mitigation strategies. However, there is a shortage of labeled datasets related to cyberattacks. This paper describes CybAttT, a dataset of 36,071 English cyberattack-related tweets. These tweets are manually labeled into three classes: high-risk news, normal news, and not news. Our final overall Inner Annotation agreement was 0.99 (Fleiss kappa), which represents high agreement. To ensure dataset reliability and accuracy, we conducted rigorous experiments using different supervised machine learning algorithms and various fine-tuned language models to assess its quality and suitability for its intended purpose. A high F1-score of 87.6% achieved using the CybAttT dataset not only demonstrates the potential of our approach but also validates the high quality and thoroughness of its annotations. We have made our CybAttT dataset accessible to the public for research purposes.

Keywords: cyberattacks; dataset; labeling; tweets; classification; machine learning; large language models (search for similar items in EconPapers)
JEL-codes: C8 C80 C81 C82 C83 (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2306-5729/9/3/39/pdf (application/pdf)
https://www.mdpi.com/2306-5729/9/3/39/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jdataj:v:9:y:2024:i:3:p:39-:d:1344674

Access Statistics for this article

Data is currently edited by Ms. Cecilia Yang

More articles in Data from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jdataj:v:9:y:2024:i:3:p:39-:d:1344674