Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies
Shuang Song,
Wei Jiang,
Lin Hou and
Hongyu Zhao
PLOS Computational Biology, 2020, vol. 16, issue 2, 1-18
Abstract:
Genetic risk prediction is an important problem in human genetics, and accurate prediction can facilitate disease prevention and treatment. Calculating polygenic risk score (PRS) has become widely used due to its simplicity and effectiveness, where only summary statistics from genome-wide association studies are needed in the standard method. Recently, several methods have been proposed to improve standard PRS by utilizing external information, such as linkage disequilibrium and functional annotations. In this paper, we introduce EB-PRS, a novel method that leverages information for effect sizes across all the markers to improve prediction accuracy. Compared to most existing genetic risk prediction methods, our method does not need to tune parameters nor external information. Real data applications on six diseases, including asthma, breast cancer, celiac disease, Crohn’s disease, Parkinson’s disease and type 2 diabetes show that EB-PRS achieved 307.1%, 42.8%, 25.5%, 3.1%, 74.3% and 49.6% relative improvements in terms of predictive r2 over standard PRS method with optimally tuned parameters. Besides, compared to LDpred that makes use of LD information, EB-PRS also achieved 37.9%, 33.6%, 8.6%, 36.2%, 40.6% and 10.8% relative improvements. We note that our method is not the first method leveraging effect size distributions. Here we first justify our method by presenting theoretical optimal property over existing methods in this class of methods, and substantiate our theoretical result with extensive simulation results. The R-package EBPRS that implements our method is available on CRAN.Author summary: Genetic risk prediction is of considerable importance in human genetics. Calculating polygenic risk score (PRS) is the most commonly used approach due to its simplicity as well as the computational efficiency. Many PRS calculation methods have been proposed for accurate prediction by borrowing information from external panels or datasets. Here, instead of external information, we propose a novel PRS method leveraging internal information, namely the effect size distributions, to improve prediction accuracy. Neither parameter tuning nor external information is needed for our method. Extensive simulations and real data applications demonstrate that our method has substantial improvement over existing methods in terms of prediction accuracy.
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (3)
Downloads: (external link)
https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1007565 (text/html)
https://journals.plos.org/ploscompbiol/article/fil ... 07565&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pcbi00:1007565
DOI: 10.1371/journal.pcbi.1007565
Access Statistics for this article
More articles in PLOS Computational Biology from Public Library of Science
Bibliographic data for series maintained by ploscompbiol ().