Enhancing Corporate Transparency: AI-Based Detection of Financial Misstatements in Korean Firms Using NearMiss Sampling and Explainable Models
Woosung Kim and
Sooin Kim ()
Additional contact information
Woosung Kim: Department of Business Administration, Konkuk University, Seoul 05029, Republic of Korea
Sooin Kim: Department of Business Administration, Konkuk University, Seoul 05029, Republic of Korea
Sustainability, 2025, vol. 17, issue 19, 1-27
Abstract:
Corporate transparency is vital for sustainable governance. However, detecting financial misstatements remains challenging due to their rarity and resulting class imbalance. Using financial statement data from Korean firms, this study develops an integrated AI framework that evaluates the joint effects of sampling strategy, model choice, and interpretability. Across multiple imbalance ratios, NearMiss undersampling consistently outperforms random undersampling—particularly in recall and F1-score—showing that careful data balancing can yield greater improvements than algorithmic complexity alone. To ensure interpretability rests on reliable predictions, we apply Shapley Additive Explanations (SHAP) and Permutation Feature Importance (PFI) only to high-performing models. Logistic regression emphasizes globally influential operating and financing accounts, whereas Random Forest identifies context-dependent patterns such as ownership structure and discretionary spending. Even with a reduced feature set identified by explainable AI, models maintain robust detection performance under low imbalance, highlighting the practical value of interpretability in building simpler and more transparent systems. By combining predictive accuracy with transparency, this study contributes to trustworthy misstatement detection tools that reinforce investor confidence, strengthen responsible corporate governance, and reduce information asymmetry. In doing so, it advances the United Nations Sustainable Development Goal 16 (Peace, Justice, and Strong Institutions) by supporting fair, accountable, and sustainable economic systems.
Keywords: financial misstatement detection; AI-based framework; NearMiss sampling; explainable AI; class imbalance; corporate transparency; sustainable corporate governance; sustainable development (SDG 16) (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/17/19/8933/pdf (application/pdf)
https://www.mdpi.com/2071-1050/17/19/8933/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:17:y:2025:i:19:p:8933-:d:1766952
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().