Performance of evaluation metrics for classification in imbalanced data
Alex Cruz Huayanay (),
Jorge L. Bazán () and
Cibele M. Russo ()
Additional contact information
Alex Cruz Huayanay: Pontificia Universidad Católica del Perú
Jorge L. Bazán: University of São Paulo
Cibele M. Russo: University of São Paulo
Computational Statistics, 2025, vol. 40, issue 3, No 12, 1447-1473
Abstract:
Abstract This paper investigates the effectiveness of various metrics for selecting the adequate model for binary classification when data is imbalanced. Through an extensive simulation study involving 12 commonly used metrics of classification, our findings indicate that the Matthews Correlation Coefficient, G-Mean, and Cohen’s kappa consistently yield favorable performance. Conversely, the area under the curve and Accuracy metrics demonstrate poor performance across all studied scenarios, while other seven metrics exhibit varying degrees of effectiveness in specific scenarios. Furthermore, we discuss a practical application in the financial area, which confirms the robust performance of these metrics in facilitating model selection among alternative link functions.
Keywords: Asymmetric links; Bayesian estimation; Binary classification; Imbalance; Metrics for classification (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s00180-024-01539-5 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:compst:v:40:y:2025:i:3:d:10.1007_s00180-024-01539-5
Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/180/PS2
DOI: 10.1007/s00180-024-01539-5
Access Statistics for this article
Computational Statistics is currently edited by Wataru Sakamoto, Ricardo Cao and Jürgen Symanzik
More articles in Computational Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().