EconPapers    
Economics at your fingertips  
 

Predicting financial fraud in Chinese listed companies: An enterprise portrait and machine learning approach

Zejun Zhang, Zhao Wang and Lixin Cai

Pacific-Basin Finance Journal, 2025, vol. 90, issue C

Abstract: Financial fraud of listed companies is a frequent problem in the capital market. Due to factors such as information asymmetry and inadequate regulation, financial fraud severely restricts stakeholders' capital allocation behavior and hinders the sustainable development of the capital market. However, existing research lacks systematic and quantitative insights into the characteristics of firms involved in financial fraud, making it difficult to achieve quantitative identification of most such firms. This limitation arises from a predominant focus on the causal relationships between various financial indicators and financial fraud. In this paper, we integrate machine learning and enterprise portrait methods, using listed companies in the Chinese capital market as research subjects to predict corporate financial fraud. Firstly, a comprehensive system of indicators is established, covering seven dimensions: basic corporate information, profitability, solvency, operating efficiency, capital structure, corporate governance, and emotional attitude. Subsequently, the feature visualization portrait is created using Gaussian mixture model (GMM) clustering and label classification, while the predictive role of multidimensional enterprise portrait features in assessing the risk of corporate financial fraud is examined. The results indicate that unstructured indicators, such as Management Discussion and Analysis (MD&A), can significantly enhance predictive capability for corporate financial fraud. The SHapley Additive exPlanations (SHAP) method is introduced to reveal the influencing factors and characteristics of financial fraud. The empirical findings show that firms involved in financial fraud typically exhibit characteristics such as shorter listing times, weaker solvency and operating efficiency, higher capital structure, and poor corporate governance ability. Moreover, the XGBoost model demonstrates superior predictive performance among various models. The findings of this study provide a new perspective for in-depth exploration of the impact mechanisms of financial fraud and related regulatory warnings. These findings contribute to enhancing the effectiveness of governance and the capital allocation function within the capital market.

Keywords: Financial fraud; Enterprise portrait; Machine learning; Unstructured data; Interpretability analysis (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0927538X25000022
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:pacfin:v:90:y:2025:i:c:s0927538x25000022

DOI: 10.1016/j.pacfin.2025.102665

Access Statistics for this article

Pacific-Basin Finance Journal is currently edited by K. Chan and S. Ghon Rhee

More articles in Pacific-Basin Finance Journal from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-03-24
Handle: RePEc:eee:pacfin:v:90:y:2025:i:c:s0927538x25000022