Credit scoring prediction leveraging interpretable ensemble learning
Yang Liu,
Fei Huang,
Lili Ma,
Qingguo Zeng and
Jiale Shi
Journal of Forecasting, 2024, vol. 43, issue 2, 286-308
Abstract:
Credit scoring models based on machine learning often need to work on accuracy and interpretability in practical applications. Original KCDWU has a more prominent adaptive property but ignores intra‐class and inter‐class distances in the clustering process, resulting in the possibility of inaccurate identification of class features and cluster structure of data, which compromises the clustering effect. Therefore, we improve the automatic K‐means clustering based on the Calinski–Harabasz index, thus achieving a clustering output for improved results. We also scrutinize representative five single classification models and six ensemble learning models for credit scoring prediction. We empirically test the superior performance of ensemble learning models and identify the best model CatBoost by comparing them based on multiple evaluation indicators. Empirical results reveal that the SHAP method conforms well to CatBoost and delivers a global and local interpretation of the predictions. This work provides financial institutions with a promising candidate for interpretable credit scoring models.
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/for.3033
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wly:jforec:v:43:y:2024:i:2:p:286-308
Access Statistics for this article
Journal of Forecasting is currently edited by Derek W. Bunn
More articles in Journal of Forecasting from John Wiley & Sons, Ltd.
Bibliographic data for series maintained by Wiley Content Delivery ().