EconPapers    
Economics at your fingertips  
 

A Comprehensive Evaluation of Data Balancing Techniques and Machine Learning Models for Credit Risk Assessment

Safae Ndama (), Oussama Ndama () and El Mokhtar En-Naimi ()
Additional contact information
Safae Ndama: FST of Tangier, Abdelmalek Essaâdi University, DSAI2S Research Team, C3S Laboratory
Oussama Ndama: FST of Tangier, Abdelmalek Essaâdi University, DSAI2S Research Team, C3S Laboratory
El Mokhtar En-Naimi: FST of Tangier, Abdelmalek Essaâdi University, DSAI2S Research Team, C3S Laboratory

A chapter in Technological Innovations for Sustainable Development, 2025, pp 142-155 from Springer

Abstract: Abstract Credit risk assessment plays a critical role in ensuring responsible lending and reducing financial losses in the banking sector. With the growing availability of financial data, machine learning (ML) has emerged as a powerful tool for evaluating creditworthiness by uncovering hidden patterns. However, the highly imbalanced nature of credit datasets where non-default cases vastly outnumber defaults poses a significant challenge to predictive accuracy. To address this, our study examines and compares six resampling techniques: Synthetic Minority Over-sampling Technique (SMOTE), Random Over Sampling, Random Under Sampling, Adaptive Synthetic Sampling (ADASYN), Tomek Links, and a hybrid SMOTE-RUS approach. These methods are applied across various ML models, including Decision Tree, Random Forest, XGBoost, Neural Network, AdaBoost, CatBoost, and K-Nearest Neighbors. The hybrid SMOTE-RUS technique demonstrated the most balanced outcomes, enhancing accuracy while reducing overfitting risks. SMOTE and ADASYN also showed notable improvements in detecting minority-class instances. This research offers a detailed comparative analysis of resampling strategies, helping to guide the development of more reliable and equitable credit scoring models for real-world applications.

Keywords: Credit Risk Assessment; Machine Learning in Finance; Class Imbalance; Data Balancing Techniques; SMOTE and ADASYN (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:lnichp:978-3-032-06725-8_12

Ordering information: This item can be ordered from
http://www.springer.com/9783032067258

DOI: 10.1007/978-3-032-06725-8_12

Access Statistics for this chapter

More chapters in Lecture Notes in Information Systems and Organization from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-06-17
Handle: RePEc:spr:lnichp:978-3-032-06725-8_12