ETHIAD: A novel explainable model for detecting illicit accounts on Ethereum
Jiarong Lu,
Bin Liao,
Yi Liu and
Kutorzi Edwin Yao
PLOS ONE, 2025, vol. 20, issue 12, 1-23
Abstract:
Ethereum has become a significant trading platform for financial activities such as Dapps, ICOs, and DeFi. However, it has also become a hub for criminal activities such as fraud, money laundering, and illicit fundraising. The construction of fraud detection models employing machine learning techniques is currently a mainstream research direction. Nevertheless, existing studies face significant challenges, including class imbalance in data samples and a lack of model interpretability. In this content, this work proposes a novel explainable model for Ethereum illicit account detection, ETHIAD (Ethereum Illicit Account Detection). Firstly, we pre-process the dataset by ADASYN oversampling and Lasso feature selection, etc., to more efficiently achieve feature modeling of transaction structures. Then, the ETHIAD model is trained using the XGboost algorithm, with an accuracy, precision, recall, F1 score, and AUC value of 99.70%, 99.51%, 99.02%, 99.26%, and 99.45%, respectively, the model outperforms the existing SOTA model by 0.05%−1.1%. Finally, we introduce SHAP framework to analyze the key influencing factors of illicit accounts from multiple perspectives, and the conclusions strongly enhance the explainability of the model.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0338366 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 38366&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0338366
DOI: 10.1371/journal.pone.0338366
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().