Programmers' de-anonymization using a hybrid approach of abstract syntax tree and deep learning
Farhan Ullah,
Sohail Jabbar and
Fadi Al-Turjman
Technological Forecasting and Social Change, 2020, vol. 159, issue C
Abstract:
Source Code Authorship Attribution (SCAA) is a direct challenge to the privacy and anonymity of developers. However, it is important to recognize the malicious authors and the origin of the attack. In this paper, we proposed Source Code Authorship Attribution using Abstract Syntax Tree (SCAA-AST) for efficient classification of programmers. First, the AST hierarchal features are generated from different programming codes. Second, preprocessing techniques are used to obtain useful features without sound data. Third, the Term Frequency Inverse Document Frequency (TFIDF) weighting technique is used to zoom in on the significance of each feature. Fourth, the Adaptive Synthetic (ADASYN) oversampling method is used to solve the imbalanced class problem. Finally, a deep learning algorithm is designed with the TensorFlow framework, and the Keras API is used to classify programming authors. A deep learning algorithm is further configured with a dropout layer, learning error rate, loss and activation function, and dense layers to enhance the classification results. The results are appreciable in outperforming the existing techniques from the perspective of classification accuracy.
Keywords: Source code authorship; Software forensics; Abstract syntax tree; Adasyn; Software Security; Software Similarity (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S004016252031012X
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:tefoso:v:159:y:2020:i:c:s004016252031012x
DOI: 10.1016/j.techfore.2020.120186
Access Statistics for this article
Technological Forecasting and Social Change is currently edited by Fred Phillips
More articles in Technological Forecasting and Social Change from Elsevier
Bibliographic data for series maintained by Catherine Liu ().