Binacox: automatic cut‐point detection in high‐dimensional Cox model with applications in genetics
Simon Bussy,
Mokhtar Z. Alaya,
Anne‐Sophie Jannot and
Agathe Guilloux
Biometrics, 2022, vol. 78, issue 4, 1414-1426
Abstract:
We introduce binacox, a prognostic method to deal with the problem of detecting multiple cut‐points per feature in a multivariate setting where a large number of continuous features are available. The method is based on the Cox model and combines one‐hot encoding with the binarsity penalty, which uses total‐variation regularization together with an extra linear constraint, and enables feature selection. Original nonasymptotic oracle inequalities for prediction (in terms of Kullback–Leibler divergence) and estimation with a fast rate of convergence are established. The statistical performance of the method is examined in an extensive Monte Carlo simulation study, and then illustrated on three publicly available genetic cancer data sets. On these high‐dimensional data sets, our proposed method outperforms state‐of‐the‐art survival models regarding risk prediction in terms of the C‐index, with a computing time orders of magnitude faster. In addition, it provides powerful interpretability from a clinical perspective by automatically pinpointing significant cut‐points in relevant variables.
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1111/biom.13547
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:biomet:v:78:y:2022:i:4:p:1414-1426
Ordering information: This journal article can be ordered from
http://www.blackwell ... bs.asp?ref=0006-341X
Access Statistics for this article
More articles in Biometrics from The International Biometric Society
Bibliographic data for series maintained by Wiley Content Delivery ().