Ensemble survival tree models to reveal pairwise interactions of variables with time-to-events outcomes in low-dimensional setting
Dazard Jean-Eudes (),
Zimmerman Peter () and
Additional contact information
Dazard Jean-Eudes: Case Western Reserve University, School of Medicine, Center for Proteomics and Bioinformatics, Cleveland, OH 44106, USA
Ishwaran Hemant: The University of Miami, Department of Epidemiology and Public Health, Division of Biostatistics, Miami, FL 33136, USA
Zimmerman Peter: Case Western Reserve University, School of Medicine, Center for Global Health and Diseases, Cleveland, OH 44106, USA
Weinberg Aaron: Case Western Reserve University, School of Dental Medicine, Department of Biological Sciences, Cleveland, OH 44106, USA
Statistical Applications in Genetics and Molecular Biology, 2018, vol. 17, issue 1, 28
Unraveling interactions among variables such as genetic, clinical, demographic and environmental factors is essential to understand the development of common and complex diseases. To increase the power to detect such variables interactions associated with clinical time-to-events outcomes, we borrowed established concepts from random survival forest (RSF) models. We introduce a novel RSF-based pairwise interaction estimator and derive a randomization method with bootstrap confidence intervals for inferring interaction significance. Using various linear and nonlinear time-to-events survival models in simulation studies, we first show the efficiency of our approach: true pairwise interaction-effects between variables are uncovered, while they may not be accompanied with their corresponding main-effects, and may not be detected by standard semi-parametric regression modeling and test statistics used in survival analysis. Moreover, using a RSF-based cross-validation scheme for generating prediction estimators, we show that informative predictors may be inferred. We applied our approach to an HIV cohort study recording key host gene polymorphisms and their association with HIV change of tropism or AIDS progression. Altogether, this shows how linear or nonlinear pairwise statistical interactions of variables may be efficiently detected with a predictive value in observational studies with time-to-event outcomes.
Keywords: epistasis; genetic variations interactions; interaction detection and modeling; random survival forest; time-to-event analysis (search for similar items in EconPapers)
References: Add references at CitEc
Citations: Track citations by RSS feed
Downloads: (external link)
For access to full text, subscription to the journal or payment for the individual article is required.
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:17:y:2018:i:1:p:28:n:2
Ordering information: This journal article can be ordered from
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().