Evaluating the informativeness of deep learning annotations for human complex diseases
Kushal K. Dey (),
Bryce Geijn,
Samuel Sungil Kim,
Farhad Hormozdiari,
David R. Kelley and
Alkes L. Price ()
Additional contact information
Kushal K. Dey: Harvard T. H. Chan School of Public Health
Bryce Geijn: Harvard T. H. Chan School of Public Health
Samuel Sungil Kim: Harvard T. H. Chan School of Public Health
Farhad Hormozdiari: Harvard T. H. Chan School of Public Health
David R. Kelley: Calico Labs
Alkes L. Price: Harvard T. H. Chan School of Public Health
Nature Communications, 2020, vol. 11, issue 1, 1-9
Abstract:
Abstract Deep learning models have shown great promise in predicting regulatory effects from DNA sequence, but their informativeness for human complex diseases is not fully understood. Here, we evaluate genome-wide SNP annotations from two previous deep learning models, DeepSEA and Basenji, by applying stratified LD score regression to 41 diseases and traits (average N = 320K), conditioning on a broad set of coding, conserved and regulatory annotations. We aggregated annotations across all (respectively blood or brain) tissues/cell-types in meta-analyses across all (respectively 11 blood or 8 brain) traits. The annotations were highly enriched for disease heritability, but produced only limited conditionally significant results: non-tissue-specific and brain-specific Basenji-H3K4me3 for all traits and brain traits respectively. We conclude that deep learning models have yet to achieve their full potential to provide considerable unique information for complex disease, and that their conditional informativeness for disease cannot be inferred from their accuracy in predicting regulatory annotations.
Date: 2020
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-020-18515-4 Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-18515-4
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-020-18515-4
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().