SME default prediction: A systematic methods evaluation
Hamid Cheraghali and
Peter Molnár
Journal of Small Business Management, 2025, vol. 63, issue 4, 1466-1517
Abstract:
This study evaluates the performance of various methodologies used in the literature to predict failures in small- and medium-sized enterprises (SMEs) using a data set of U.S. SMEs. By evaluating over 6,100 models and data subsample combinations, we find that the light gradient boosting machine (LightGBM) exhibits the best out-of-sample predictive performance, closely followed by extreme gradient boosting (XGBoost). These two estimation methods perform best using their built-in feature-selection mechanisms and do not require sample rebalancing. However, for most other estimation methods, feature selection and sample rebalancing are critical. For example, logistic regression (Logit) performs significantly better with appropriate feature selection and sample rebalancing. We also provide an overview of the importance of various features in predicting failures.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/00472778.2024.2390500 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:ujbmxx:v:63:y:2025:i:4:p:1466-1517
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/ujbm20
DOI: 10.1080/00472778.2024.2390500
Access Statistics for this article
Journal of Small Business Management is currently edited by Eric Liguori
More articles in Journal of Small Business Management from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().