Quicksort leave-pair-out cross-validation for ROC curve analysis
Riikka Numminen (),
Ileana Montoya Perez,
Ivan Jambor,
Tapio Pahikkala and
Antti Airola
Additional contact information
Riikka Numminen: University of Turku
Ileana Montoya Perez: University of Turku
Ivan Jambor: University of Turku
Tapio Pahikkala: University of Turku
Antti Airola: University of Turku
Computational Statistics, 2023, vol. 38, issue 3, No 21, 1579-1595
Abstract:
Abstract Receiver Operating Characteristic (ROC) curve analysis and area under the ROC curve (AUC) are commonly used performance measures in diagnostic systems. In this work, we assume a setting, where a classifier is inferred from multivariate data to predict the diagnostic outcome for new cases. Cross-validation is a resampling method for estimating the prediction performance of a classifier on data not used for inferring it. Tournament leave-pair-out (TLPO) cross-validation has been shown to be better than other resampling methods at producing a ranking of data that can be used for estimating the ROC curves and areas under them. However, the time complexity of TLPOCV, $$O\left( n^2\right)$$ O n 2 , means that it is impractical in many applications. In this article, a method called quicksort leave-pair-out cross-validation (QLPOCV) is presented in order to decrease the time complexity of obtaining a reliable ranking of data to $$O\left( n\log n\right)$$ O n log n . The proposed method is compared with existing ones in an experimental study, demonstrating that in terms of ROC curves and AUC values QLPOCV produces as accurate performance estimation as TLPOCV, outperforming both k-fold and leave-one-out cross-validation.
Keywords: Cross-validation; Leave-pair-out; Quicksort; Receiver operating characteristic analysis (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s00180-022-01288-3 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:compst:v:38:y:2023:i:3:d:10.1007_s00180-022-01288-3
Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/180/PS2
DOI: 10.1007/s00180-022-01288-3
Access Statistics for this article
Computational Statistics is currently edited by Wataru Sakamoto, Ricardo Cao and Jürgen Symanzik
More articles in Computational Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().