Sparse pathway-based prediction models for high-throughput molecular data
Sangin Lee,
Youngjo Lee and
Yudi Pawitan
Computational Statistics & Data Analysis, 2018, vol. 126, issue C, 125-135
Abstract:
Pathway-based prediction problems for high-throughput molecular data motivate the development of sparsity-constrained models with structured predictive variables. Intuitively it is desirable to incorporate the structural information into the model building procedure, potentially for improving both interpretability and prediction performances. Various random-effect models are developed for structured sparse prediction where the predictive variables/genes can be naturally grouped into overlapping groups or pathways. The hierarchical likelihood approach can be used for these random-effect models that impose sparse selection of the overlapping groups as well as further selection within the selected groups. In addition, the approach leads to a unified optimization algorithm for these random-effect models. Extensive numerical studies based on simulated and real breast-cancer data demonstrate that the proposed methods perform well against existing methods that ignore the structural information.
Keywords: Hierarchical likelihood; Overlapping groups; Random-effect model; Structured variable selection (search for similar items in EconPapers)
Date: 2018
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0167947318301014
Full text for ScienceDirect subscribers only.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:csdana:v:126:y:2018:i:c:p:125-135
DOI: 10.1016/j.csda.2018.04.012
Access Statistics for this article
Computational Statistics & Data Analysis is currently edited by S.P. Azen
More articles in Computational Statistics & Data Analysis from Elsevier
Bibliographic data for series maintained by Catherine Liu ().