Ensemble classifiers for bankruptcy prediction using SMOTE and RFECV
T. Shahana,
Vilvanathan Lavanya and
Aamir Rashid Bhat
International Journal of Enterprise Network Management, 2024, vol. 15, issue 1, 109-132
Abstract:
This research investigates the impact of preprocessing strategies, namely feature selection (utilising correlation and recursive feature elimination with cross-validation) and class imbalance handling (employing synthetic minority oversampling technique), on the performance of prediction models using ensemble-learning techniques (random forest, AdaBoost, gradient boosting decision tree, extreme gradient boosting, bagging, LightGBM and extra tree classifier). The study focuses on the Polish bankruptcy dataset to assess the effectiveness of these preprocessing approaches. Experimental results demonstrate that adopting class imbalance handling significantly influences classifier performance compared to feature selection alone. Interestingly, hyperparameter tuning and feature selection exhibit limited impact on classifier performance. Among the ensemble-learning techniques tested, the adaptive boosting classifier shows consistently poor performance throughout the study period, followed by the bagging classifier with statistical significance. These findings shed light on the importance of selecting appropriate preprocessing strategies to improve the performance of ensemble-based prediction models in bankruptcy prediction tasks.
Keywords: bankruptcy prediction; ensemble classifiers; missing value imputation; SMOTE; correlation; RFECV. (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.inderscience.com/link.php?id=137456 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:ijenma:v:15:y:2024:i:1:p:109-132
Access Statistics for this article
More articles in International Journal of Enterprise Network Management from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().