Accuracy Comparison between Five Machine Learning Algorithms for Financial Risk Evaluation

Dong, Haokun; Liu, Rui; Tham, Allan W.

Accuracy Comparison between Five Machine Learning Algorithms for Financial Risk Evaluation

Haokun Dong, Rui Liu and Allan W. Tham ()
Additional contact information
Haokun Dong: Faculty of Science and Technology, University of Canberra, Canberra 2617, Australia
Rui Liu: Faculty of Science and Technology, University of Canberra, Canberra 2617, Australia
Allan W. Tham: Faculty of Science and Technology, University of Canberra, Canberra 2617, Australia

JRFM, 2024, vol. 17, issue 2, 1-19

Abstract: An accurate prediction of loan default is crucial in credit risk evaluation. A slight deviation from true accuracy can often cause financial losses to lending institutes. This study describes the non-parametric approach that compares five different machine learning classifiers combined with a focus on sufficiently large datasets. It presents the findings on various standard performance measures such as accuracy, precision, recall and F1 scores in addition to Receiver Operating Curve-Area Under Curve (ROC-AUC). In this study, various data pre-processing techniques including normalization and standardization, imputation of missing values and the handling of imbalanced data using SMOTE will be discussed and implemented. Also, the study examines the use of hyper-parameters in various classifiers. During the model construction phase, various pipelines feed data to the five machine learning classifiers, and the performance results obtained from the five machine learning classifiers are based on sampling with SMOTE or hyper-parameters versus without SMOTE and hyper-parameters. Each classifier is compared to another in terms of accuracy during training and prediction phase based on out-of-sample data. The 2 data sets used for this experiment contain 1000 and 30,000 observations, respectively, of which the training/testing ratio is 80:20. The comparative results show that random forest outperforms the other four classifiers both in training and actual prediction.

Keywords: financial data analysis; machine learning algorithms; loan default assessment; classification (search for similar items in EconPapers)
JEL-codes: C E F2 F3 G (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/1911-8074/17/2/50/pdf (application/pdf)
https://www.mdpi.com/1911-8074/17/2/50/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jjrfmx:v:17:y:2024:i:2:p:50-:d:1328960

Access Statistics for this article

JRFM is currently edited by Ms. Chelthy Cheng

More articles in JRFM from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().