Estimating credit risk parameters using ensemble learning methods: an empirical study on loss given default
Han Sheng Sun and
Zi Jin
Journal of Credit Risk
Abstract:
ABSTRACT In credit risk modeling, banks and insurance companies routinely use a single model for estimating key risk parameters. Combining several models to make a final prediction is not often considered. Using an ensemble or a collection of models rather than a single model can improve the accuracy and robustness of prediction results. In this study, we investigate two well-established ensemble learning methods (stochastic gradient boosting and random forest) and propose two new ensembles (ensemble by partial least squares and bag-boosting) in the application of predicting the loss given default. We demonstrate that an ensemble approach significantly increases the discriminatory power of the model compared with a single decision tree. In addition, the ensemble learning methods can be applied directly to predicting the exposure at default and probability of default with some simple modifications. The proposed approaches introduce a novel modeling framework that banks and other financial institutions can use to estimate and validate credit risk parameters based on the internal data of different portfolios. Moreover, the proposed approaches can be readily extended to general portfolio risk modeling in the areas of regulatory capital and economic capital management, loss forecasting, stress testing and pre-provision net revenue projections. ;
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.risk.net/journal-of-credit-risk/246658 ... n-loss-given-default (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:rsk:journ1:2466583
Access Statistics for this article
More articles in Journal of Credit Risk from Journal of Credit Risk
Bibliographic data for series maintained by Thomas Paine ().