A Self-Supervised Learning Model for Unknown Internet Traffic Identification Based on Surge Period
Dawei Wei,
Feifei Shi and
Sahraoui Dhelim ()
Additional contact information
Dawei Wei: School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
Feifei Shi: School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
Sahraoui Dhelim: School of Computer Science, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland
Future Internet, 2022, vol. 14, issue 10, 1-16
Abstract:
The identification of Internet protocols provides a significant basis for keeping Internet security and improving Internet Quality of Service (QoS). However, the overwhelming developments and updating of Internet technologies and protocols have led to large volumes of unknown Internet traffic, which threaten the safety of the network environment a lot. Since most of the unknown Internet traffic does not have any labels, it is difficult to adopt deep learning directly. Additionally, the feature accuracy and identification model also impact the identification accuracy a lot. In this paper, we propose a surge period-based feature extraction method that helps remove the negative influence of background traffic in network sessions and acquire as many traffic flow features as possible. In addition, we also establish an identification model of unknown Internet traffic based on JigClu, the self-supervised learning approach to training unlabeled datasets. It finally combines with the clustering method and realizes the further identification of unknown Internet traffic. The model has been demonstrated with an accuracy of no less than 74% in identifying unknown Internet traffic with the public dataset ISCXVPN2016 under different scenarios. The work provides a novel solution for unknown Internet traffic identification, which is the most difficult task in identifying Internet traffic. We believe it is a great leap in Internet traffic identification and is of great significance to maintaining the security of the network environment.
Keywords: unknown Internet traffic identification; self-supervised learning; surge period; clustering (search for similar items in EconPapers)
JEL-codes: O3 (search for similar items in EconPapers)
Date: 2022
References: View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www.mdpi.com/1999-5903/14/10/289/pdf (application/pdf)
https://www.mdpi.com/1999-5903/14/10/289/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jftint:v:14:y:2022:i:10:p:289-:d:937541
Access Statistics for this article
Future Internet is currently edited by Ms. Grace You
More articles in Future Internet from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().