The effect of training set selection when predicting defaulting small and medium-sized enterprises with unbalanced data
Giovanna Menardi and Nicola Torelli
Journal of Credit Risk
Abstract:
ABSTRACT We focus on classification methods to separate defaulting small and medium sized enterprises from nondefaulting ones. In this framework, a typical problem occurs because the proportion of defaulting firms is very close to zero, leading to a class imbalance. Moreover, a form of bias may affect the classification because models are often estimated on samples of large corporations that are not randomly selected. We investigate how different criteria of sample selection may affect the accuracy of the classification and how this problem is strongly related to class imbalance.
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.risk.net/journal-of-credit-risk/231008 ... with-unbalanced-data (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:rsk:journ1:2310088
Access Statistics for this article
More articles in Journal of Credit Risk from Journal of Credit Risk
Bibliographic data for series maintained by Thomas Paine ().