Ensemble survival tree models to reveal pairwise interactions of variables with time-to-events outcomes in low-dimensional setting
Dazard Jean-Eudes (),
Ishwaran Hemant,
Mehlotra Rajeev,
Weinberg Aaron and
Zimmerman Peter ()
Additional contact information
Dazard Jean-Eudes: Case Western Reserve University, School of Medicine, Center for Proteomics and Bioinformatics, Cleveland, OH 44106, USA
Ishwaran Hemant: The University of Miami, Department of Epidemiology and Public Health, Division of Biostatistics, Miami, FL 33136, USA
Mehlotra Rajeev: Case Western Reserve University, School of Medicine, Center for Global Health and Diseases, Cleveland, OH 44106, USA
Weinberg Aaron: Case Western Reserve University, School of Dental Medicine, Department of Biological Sciences, Cleveland, OH 44106, USA
Zimmerman Peter: Case Western Reserve University, School of Medicine, Center for Global Health and Diseases, Cleveland, OH 44106, USA
Statistical Applications in Genetics and Molecular Biology, 2018, vol. 17, issue 1, 28
Abstract:
Unraveling interactions among variables such as genetic, clinical, demographic and environmental factors is essential to understand the development of common and complex diseases. To increase the power to detect such variables interactions associated with clinical time-to-events outcomes, we borrowed established concepts from random survival forest (RSF) models. We introduce a novel RSF-based pairwise interaction estimator and derive a randomization method with bootstrap confidence intervals for inferring interaction significance. Using various linear and nonlinear time-to-events survival models in simulation studies, we first show the efficiency of our approach: true pairwise interaction-effects between variables are uncovered, while they may not be accompanied with their corresponding main-effects, and may not be detected by standard semi-parametric regression modeling and test statistics used in survival analysis. Moreover, using a RSF-based cross-validation scheme for generating prediction estimators, we show that informative predictors may be inferred. We applied our approach to an HIV cohort study recording key host gene polymorphisms and their association with HIV change of tropism or AIDS progression. Altogether, this shows how linear or nonlinear pairwise statistical interactions of variables may be efficiently detected with a predictive value in observational studies with time-to-event outcomes.
Keywords: epistasis; genetic variations interactions; interaction detection and modeling; random survival forest; time-to-event analysis (search for similar items in EconPapers)
Date: 2018
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1515/sagmb-2017-0038 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:17:y:2018:i:1:p:28:n:1
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html
DOI: 10.1515/sagmb-2017-0038
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().