EconPapers    
Economics at your fingertips  
 

Machine learning-based variable selection for clustered credit risk modeling

Joost Bosker (), Marc Gürtler () and Marvin Zöllner ()
Additional contact information
Joost Bosker: University of Braunschweig–Institute of Technology
Marc Gürtler: University of Braunschweig–Institute of Technology
Marvin Zöllner: University of Braunschweig–Institute of Technology

Journal of Business Economics, 2025, vol. 95, issue 4, No 5, 617-652

Abstract: Abstract Several studies have demonstrated the high prediction accuracy of clustered credit risk modeling. In clustered modeling, borrowers are segmented based on their similarities through cluster analysis, and a separate predictive model is developed for each cluster, resulting in increased predictive accuracy. Unambiguously, its effectiveness depends on the quality of the segmentation, which in turn depends primarily on the choice of variables used in the cluster analysis. However, appropriate variable selection for clustering is a major challenge, particularly for high-dimensional data. In the present study, we propose a machine learning-based variable selection method based on theoretical and regulatory considerations. Formally, the most influential risk drivers from a best-in-class machine learning model are identified using Shapley values and employed as clustering variables. Thus, the information of the explanatory variables crucial for the prediction of the dependent variable is already processed during data segmentation, making each individual predictive model more effective. Through a comparative analysis using two real-world credit default datasets, we show that our proposed approach to clustered modeling leads to the highest prediction accuracy among various clustering models.

Keywords: Credit risk; Forecasting; Clustering; Machine learning; Global credit data (search for similar items in EconPapers)
JEL-codes: C38 C45 C52 C53 G21 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s11573-024-01213-8 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:jbecon:v:95:y:2025:i:4:d:10.1007_s11573-024-01213-8

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/11573

DOI: 10.1007/s11573-024-01213-8

Access Statistics for this article

Journal of Business Economics is currently edited by Günter Fandel

More articles in Journal of Business Economics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-06-19
Handle: RePEc:spr:jbecon:v:95:y:2025:i:4:d:10.1007_s11573-024-01213-8