ROC Curve Analysis in the Presence of Imperfect Reference Standards
Peizhou Liao (),
Hao Wu () and
Tianwei Yu ()
Additional contact information
Peizhou Liao: Emory University
Hao Wu: Emory University
Tianwei Yu: Emory University
Statistics in Biosciences, 2017, vol. 9, issue 1, No 6, 104 pages
Abstract:
Abstract The receiver operating characteristic (ROC) curve is an important tool for the evaluation and comparison of predictive models when the outcome is binary. If the class membership of the outcomes is known, ROC can be constructed for a model, and the ROC with greater area under the curve indicates better performance. However in practice, imperfect reference standards often exist, in which class membership of every data point is not fully determined. This situation is especially prevalent in high-throughput biomedical data because obtaining perfect reference standards for all data points is either too costly or technically impractical. To construct ROC curves for these data, the common practice is to either ignore the uncertainties in references or remove data points with high uncertainties. Such approaches may cause bias to the ROC curves and generate misleading results in method evaluation. Here we present a framework to incorporate membership uncertainties into the construction of ROC curve, termed the expected ROC or “eROC” curve. We develop an efficient procedure for the estimation of eROC curve. The advantages of using eROC are demonstrated using simulated and real data.
Keywords: ROC curve; High-throughput data; Imperfect reference standards (search for similar items in EconPapers)
Date: 2017
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s12561-016-9159-7 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:stabio:v:9:y:2017:i:1:d:10.1007_s12561-016-9159-7
Ordering information: This journal article can be ordered from
http://www.springer.com/journal/12561
DOI: 10.1007/s12561-016-9159-7
Access Statistics for this article
Statistics in Biosciences is currently edited by Hongyu Zhao and Xihong Lin
More articles in Statistics in Biosciences from Springer, International Chinese Statistical Association
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().