Binary disease prediction using tail quantiles of the distribution of continuous biomarkers
Michiel H. J. Paus,
Edwin R. van den Heuvel and
Marc J. M. Meddens
Journal of Nonparametric Statistics, 2023, vol. 35, issue 1, 56-87
Abstract:
In the analysis of binary disease classification, numerous techniques exist, but they merely work well for mean differences in biomarkers between cases and controls. Biological processes are, however, much more heterogeneous, and differences could also occur in other distributional characteristics (e.g. variances, skewness). Many machine learning techniques are better capable of utilizing these higher-order distributional differences, sometimes at cost of explainability. In this study, we propose quantile based prediction (QBP), a binary classification method based on the selection of multiple continuous biomarkers and using the tail differences between biomarker distributions of cases and controls. The performance of QBP is compared to supervised learning methods using extensive simulation studies, and two case studies: major depression disorder (MDD) and trisomy. QBP outperformed alternative methods when biomarkers predominantly show variance differences between cases and controls, especially in the MDD case study. More research is needed to further optimise QBP.
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/10485252.2022.2141738 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:gnstxx:v:35:y:2023:i:1:p:56-87
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/GNST20
DOI: 10.1080/10485252.2022.2141738
Access Statistics for this article
Journal of Nonparametric Statistics is currently edited by Jun Shao
More articles in Journal of Nonparametric Statistics from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().