A new machine learning approach to optimize correlated biomarkers
Ya-Hsun Lee,
Yi-Hau Chen and
Chao-Yu Guo
Communications in Statistics - Theory and Methods, 2025, vol. 54, issue 23, 7515-7526
Abstract:
The number of novel biomarkers is booming. However, a simple predictive score is more feasible to evaluate the clinical outcome and provide better accuracy. However, the optimal linear combination of correlated biomarkers demands comprehensive methodological research. This research aims to develop a novel approach for interpretable optimization. This research proposes the gradient boost machine with the Youden Index (GBYI) as the target function. The rationale is that the gradient boost machine demonstrates superior prediction ability and provides excellent interpretations according to the linear model. In addition, the Youden Index could effortlessly estimate the optimal cutoff point of the diagnostic test and evaluate the overall accuracy. Simulation studies evaluate the performance of the GBYI with linear and nonlinear structured datasets. We also demonstrate an application in the Bupa Liver Disease Data, which revealed that our optimal combination of correlated biomarkers shows an improved prediction with higher accuracy. This research proposes a novel machine-learning strategy using the powerful statistical boosting technique of the Youden Index. The new machine could optimize the combination of high-dimensional data and provide attractive interpretable coefficients.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/03610926.2025.2477289 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:lstaxx:v:54:y:2025:i:23:p:7515-7526
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/lsta20
DOI: 10.1080/03610926.2025.2477289
Access Statistics for this article
Communications in Statistics - Theory and Methods is currently edited by Debbie Iscoe
More articles in Communications in Statistics - Theory and Methods from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().