Evaluation of Cost-Sensitive Learning Models in Forecasting Business Failure of Capital Market Firms
Pejman Peykani (),
Moslem Peymany Foroushany,
Cristina Tanasescu,
Mostafa Sargolzaei and
Hamidreza Kamyabfar
Additional contact information
Pejman Peykani: Department of Industrial Engineering, Faculty of Engineering, Khatam University, Tehran 1991633357, Iran
Moslem Peymany Foroushany: Department of Finance and Banking, Faculty of Management and Accounting, Allameh Tabataba’i University, Tehran 1489684511, Iran
Cristina Tanasescu: Faculty of Economic Sciences, Lucian Blaga University of Sibiu, 550324 Sibiu, Romania
Mostafa Sargolzaei: Department of Finance and Banking, Faculty of Management and Accounting, Allameh Tabataba’i University, Tehran 1489684511, Iran
Hamidreza Kamyabfar: Department of Finance and Banking, Faculty of Management and Accounting, Allameh Tabataba’i University, Tehran 1489684511, Iran
Mathematics, 2025, vol. 13, issue 3, 1-29
Abstract:
Classifying imbalanced data is a well-known challenge in machine learning. One of the fields inherently affected by imbalanced data is credit datasets in finance. In this study, to address this challenge, we employed one of the most recent methods developed for classifying imbalanced data, CorrOV-CSEn. In addition to the original CorrOV-CSEn approach, which uses AdaBoost as its base learning method, we also applied Multi-Layer Perceptron (MLP), random forest, gradient boosted trees, XGBoost, and CatBoost. Our dataset, sourced from the Iran capital market from 2015 to 2022, utilizes the more general and accurate term business failure instead of default. Model performance was evaluated using sensitivity, precision, and F1 score, while their overall performance was compared using the Friedman–Nemenyi test. The results indicate the high effectiveness of all models in identifying failing businesses (sensitivity), with CatBoost achieving a sensitivity of 0.909 on the test data. However, all models exhibited relatively low precision.
Keywords: business failure forecasting; imbalanced data; cost-sensitive learning; machine learning; Multi-Layer Perceptron (MLP); random forest; gradient boosted trees; XGBoost; CatBoost; AdaBoost (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/13/3/368/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/3/368/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:3:p:368-:d:1574687
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().