EconPapers    
Economics at your fingertips  
 

An Empirical Bayes risk prediction model using multiple traits for sequencing data

Li Gengxin (), Cui Yuehua and Zhao Hongyu
Additional contact information
Li Gengxin: Department of Mathematics and Statistics, Wright State University, 3640 Colonel Glenn Hwy, Dayton, OH 45435, USA
Cui Yuehua: Department of Statistics and Probability, Michigan State University, 619 Red Cedar Rd, East Lansing, MI 48824,USA
Zhao Hongyu: Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT 06520, USA

Statistical Applications in Genetics and Molecular Biology, 2015, vol. 14, issue 6, 551-573

Abstract: The rapidly developing sequencing technologies have led to improved disease risk prediction through identifying many novel genes. Many prediction methods have been proposed to use rich genomic information to predict binary disease outcomes. It is intuitive that these methods can be further improved by making efficient use of the rich information in measured quantitative traits that are correlated with binary outcomes. In this article, we propose a novel Empirical Bayes prediction model that uses information from both quantitative traits and binary disease status to improve risk prediction. Our method is built on a new statistic that better infers the gene effect on multiple traits, and it also enjoys the good theoretical properties. We then consider using sequencing data by combining information from multiple rare variants in individual genes to strengthen the signals of causal genetic effects. In simulation study, we find that our proposed Empirical Bayes approach is superior to other existing methods in terms of feature selection and risk prediction. We further evaluate the effectiveness of our proposed method through its application to the sequencing data provided by the Genetic Analysis Workshop 18.

Keywords: area under the ROC curve (AUC); cross validation (CV); Empirical Bayes (EB) estimate; multiple traits; receiver operating characteristic curve (ROC) (search for similar items in EconPapers)
Date: 2015
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/sagmb-2015-0060 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:14:y:2015:i:6:p:551-573:n:4

Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html

DOI: 10.1515/sagmb-2015-0060

Access Statistics for this article

Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf

More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-03-19
Handle: RePEc:bpj:sagmbi:v:14:y:2015:i:6:p:551-573:n:4