A copula-based method of classifying individuals into binary disease categories using dependent biomarkers
Shofiqul Islam (),
Sonia Anand (),
Jemila Hamid (),
Lehana Thabane () and
Joseph Beyene ()
Additional contact information
Shofiqul Islam: McMaster University
Sonia Anand: McMaster University
Jemila Hamid: McMaster University
Lehana Thabane: McMaster University
Joseph Beyene: McMaster University
Statistical Methods & Applications, 2020, vol. 29, issue 4, No 9, 897 pages
Abstract:
Abstract Classification of a disease often depends on more than one test, and the tests can be interrelated. Under the incorrect assumption of independence, the test result using dependent biomarkers can lead to a conflicting disease classification. We develop a copula-based method for this purpose that takes dependency into account and leads to a unique decision. We first construct the joint probability distribution of the biomarkers considering Frank’s, Clayton’s and Gumbel’s copulas. We then develop the classification method and perform a comprehensive simulation. Using simulated data sets, we study the statistical properties of joint probability distributions and determine the joint threshold with maximum classification accuracy. Our simulation study results show that parameter estimates for the copula-based bivariate distributions are not biased. We observe that the thresholds for disease classification converge to a stationary distribution across different choices of copulas. We also observe that the classification accuracy decreases with the increasing value of the dependence parameter of the copulas. Finally, we illustrate our method with a real data example, where we identify the joint threshold of Apolipoprotein B to Apolipoprotein A1 ratio and total cholesterol to high-density lipoprotein ratio for the classification of myocardial infarction. We conclude, the copula-based method works well in identifying the joint threshold of two dependent biomarkers for an outcome classification. Our method is flexible and allows modeling broad classes of bivariate distributions that take dependency into account. The threshold may allow clinicians to classify uniquely individuals at risk of developing the disease and plan for early intervention.
Keywords: Copula; Gamma distribution; Biomarker; Sensitivity; Specificity; AUC (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10260-020-00507-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:stmapp:v:29:y:2020:i:4:d:10.1007_s10260-020-00507-9
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/10260/PS2
DOI: 10.1007/s10260-020-00507-9
Access Statistics for this article
Statistical Methods & Applications is currently edited by Tommaso Proietti
More articles in Statistical Methods & Applications from Springer, Società Italiana di Statistica
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().