Research on a financial fraud identification model by fusing a convolutional neural network
Haiyan Lu,
Shuhe Zhu,
Yajing Zhang and
Ruyi Lv
PLOS ONE, 2026, vol. 21, issue 5, 1-29
Abstract:
To address the challenges of low accuracy and poor real-time performance in existing financial fraud identification methods for listed companies, this paper proposes a hybrid identification model (CNN-SVM) that fuses a convolutional neural network with a support vector machine. Utilizing a comprehensive dataset of 7,429 samples from non-financial A-share listed companies penalized for fraud between 2007 and 2022, the study employs random oversampling to rebalance the minority class, resulting in a robust training set of 13,540 samples. The model leverages a CNN architecture to automatically extract high-level features from 87 indicators spanning corporate governance, financial oversight, and operational metrics, which are then classified using a linear-kernel SVM. Experimental results demonstrate that the CNN-SVM model achieves a qualitative leap in performance, yielding an AUC of 0.97, a recall of 0.99, and an F1-score of 0.97, significantly outperforming traditional logistic regression and random forest benchmarks. These findings suggest that the proposed framework effectively balances training efficiency with high precision, providing a superior tool for real-time financial risk control.
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0348569 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 48569&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0348569
DOI: 10.1371/journal.pone.0348569
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().