EconPapers    
Economics at your fingertips  
 

Quicksort leave-pair-out cross-validation for ROC curve analysis

Riikka Numminen (), Ileana Montoya Perez, Ivan Jambor, Tapio Pahikkala and Antti Airola
Additional contact information
Riikka Numminen: University of Turku
Ileana Montoya Perez: University of Turku
Ivan Jambor: University of Turku
Tapio Pahikkala: University of Turku
Antti Airola: University of Turku

Computational Statistics, 2023, vol. 38, issue 3, No 21, 1579-1595

Abstract: Abstract Receiver Operating Characteristic (ROC) curve analysis and area under the ROC curve (AUC) are commonly used performance measures in diagnostic systems. In this work, we assume a setting, where a classifier is inferred from multivariate data to predict the diagnostic outcome for new cases. Cross-validation is a resampling method for estimating the prediction performance of a classifier on data not used for inferring it. Tournament leave-pair-out (TLPO) cross-validation has been shown to be better than other resampling methods at producing a ranking of data that can be used for estimating the ROC curves and areas under them. However, the time complexity of TLPOCV, $$O\left( n^2\right)$$ O n 2 , means that it is impractical in many applications. In this article, a method called quicksort leave-pair-out cross-validation (QLPOCV) is presented in order to decrease the time complexity of obtaining a reliable ranking of data to $$O\left( n\log n\right)$$ O n log n . The proposed method is compared with existing ones in an experimental study, demonstrating that in terms of ROC curves and AUC values QLPOCV produces as accurate performance estimation as TLPOCV, outperforming both k-fold and leave-one-out cross-validation.

Keywords: Cross-validation; Leave-pair-out; Quicksort; Receiver operating characteristic analysis (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s00180-022-01288-3 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:compst:v:38:y:2023:i:3:d:10.1007_s00180-022-01288-3

Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/180/PS2

DOI: 10.1007/s00180-022-01288-3

Access Statistics for this article

Computational Statistics is currently edited by Wataru Sakamoto, Ricardo Cao and Jürgen Symanzik

More articles in Computational Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:compst:v:38:y:2023:i:3:d:10.1007_s00180-022-01288-3