EconPapers    
Economics at your fingertips  
 

Performance of evaluation metrics for classification in imbalanced data

Alex Cruz Huayanay (), Jorge L. Bazán () and Cibele M. Russo ()
Additional contact information
Alex Cruz Huayanay: Pontificia Universidad Católica del Perú
Jorge L. Bazán: University of São Paulo
Cibele M. Russo: University of São Paulo

Computational Statistics, 2025, vol. 40, issue 3, No 12, 1447-1473

Abstract: Abstract This paper investigates the effectiveness of various metrics for selecting the adequate model for binary classification when data is imbalanced. Through an extensive simulation study involving 12 commonly used metrics of classification, our findings indicate that the Matthews Correlation Coefficient, G-Mean, and Cohen’s kappa consistently yield favorable performance. Conversely, the area under the curve and Accuracy metrics demonstrate poor performance across all studied scenarios, while other seven metrics exhibit varying degrees of effectiveness in specific scenarios. Furthermore, we discuss a practical application in the financial area, which confirms the robust performance of these metrics in facilitating model selection among alternative link functions.

Keywords: Asymmetric links; Bayesian estimation; Binary classification; Imbalance; Metrics for classification (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s00180-024-01539-5 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:compst:v:40:y:2025:i:3:d:10.1007_s00180-024-01539-5

Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/180/PS2

DOI: 10.1007/s00180-024-01539-5

Access Statistics for this article

Computational Statistics is currently edited by Wataru Sakamoto, Ricardo Cao and Jürgen Symanzik

More articles in Computational Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-12
Handle: RePEc:spr:compst:v:40:y:2025:i:3:d:10.1007_s00180-024-01539-5