A comparative study of combining tree‐based feature selection methods and classifiers in personal loan default prediction
Weidong Guo and
Zach Zhizhong Zhou
Journal of Forecasting, 2022, vol. 41, issue 6, 1248-1313
Abstract:
Personal credit data usually contain a large number of features, some of which do not significantly contribute to the performance of default prediction models. Screening features through appropriate methods is essential to improve the efficiency of prediction models. However, little attention has been paid to feature selection methods in the area of personal loan default prediction. In this study, we employ random forest (RF), XGBoost, Adaptive Boosting (AdaBoost), Categorical Boosting (CatBoost), and Light Gradient Boosting Machine (LightGBM) as base algorithms of wrapper and embedded methods to select features and use these algorithms as classifiers to predict personal loan default. We find that when classical filter methods are used to select features, the number of selected features needs to be large enough to enable tree‐based classifiers to get their best performance. However, when the tree‐based algorithm is used to select features, it only needs to select a small number of features to deliver a satisfactory classification performance. AdaBoost, Chi2, and F‐score are found to be ideal feature selection methods in the area of personal credit default prediction. Moreover, we find that it is better to use different algorithms in feature selection and classification; AdaBoost and CatBoost perform the best among all classifiers.
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://doi.org/10.1002/for.2856
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wly:jforec:v:41:y:2022:i:6:p:1248-1313
Access Statistics for this article
Journal of Forecasting is currently edited by Derek W. Bunn
More articles in Journal of Forecasting from John Wiley & Sons, Ltd.
Bibliographic data for series maintained by Wiley Content Delivery ().