Enhancing default prediction in alternative lending: leveraging credit bureau data and machine learning
Zilong Liu and
Hongyan Liang
Journal of Risk Model Validation
Abstract:
Alternative lending is a vital source of credit for consumers underserved by traditional banks. This study examines how integrating additional data and advanced machine learning enhances default prediction in this sector. We merge loan records with credit bureau data and compare four variable sets: credit scores alone; loan-specific variables alone; a combination of credit scores and loan variables; and an integration of credit scores, loan variables and more than 300 credit bureau variables selected via least absolute shrinkage and selection operator (Lasso) regression. Our findings show that credit scores alone yield limited accuracy (with an area under the curve (AUC) of 0.6), while incorporating loan-specific features significantly improves performance. Further including selected credit bureau variables and tuning hyperparameters boosts predictive power, with a random forest model achieving an AUC of 0.854. Key predictors include credit scores, the loan amount, loan duration, months since the oldest trade, and recent credit inquiries. These results underscore the importance of comprehensive credit bureau data and rigorous model validation in alternative lending, offering practical insights for lenders and policy makers seeking to refine credit risk assessment.
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.risk.net/node/7961668 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:rsk:journ5:7961668
Access Statistics for this article
More articles in Journal of Risk Model Validation from Journal of Risk Model Validation
Bibliographic data for series maintained by Thomas Paine ().