A probabilistic graphical model for estimating selection coefficients of nonsynonymous variants from human population sequence data
Yige Zhao,
Tian Lan,
Guojie Zhong,
Jake Hagen,
Hongbing Pan,
Wendy K. Chung and
Yufeng Shen ()
Additional contact information
Yige Zhao: Columbia University Irving Medical Center
Tian Lan: Columbia University Irving Medical Center
Guojie Zhong: Columbia University Irving Medical Center
Jake Hagen: Columbia University Irving Medical Center
Hongbing Pan: Columbia University Irving Medical Center
Wendy K. Chung: Boston Children’s Hospital and Harvard Medical School
Yufeng Shen: Columbia University Irving Medical Center
Nature Communications, 2025, vol. 16, issue 1, 1-12
Abstract:
Abstract Accurately predicting the effect of missense variants is important in discovering disease risk genes and clinical genetic diagnostics. Commonly used computational methods predict pathogenicity, which does not capture the quantitative impact on fitness in humans. We develop a method, MisFit, to estimate missense fitness effect using a graphical model. MisFit jointly models the effect at a molecular level ( $$d$$ d ) and a population level (selection coefficient, $$s$$ s ), assuming that in the same gene, missense variants with similar $$d$$ d have similar $$s$$ s . We train it by maximizing probability of observed allele counts in 236,017 individuals of European ancestry. We show that $$s$$ s is informative in predicting allele frequency across ancestries and consistent with the fraction of de novo mutations in sites under strong selection. Further, $$s$$ s outperforms previous methods in prioritizing de novo missense variants in individuals with neurodevelopmental disorders. In conclusion, MisFit accurately predicts $$s$$ s and yields new insights from genomic data.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-025-59937-2 Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-59937-2
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-025-59937-2
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().