Statistical framework for calling allelic imbalance in high-throughput sequencing data
Andrey Buyan,
Georgy Meshcheryakov,
Viacheslav Safronov,
Sergey Abramov,
Alexandr Boytsov,
Vladimir Nozdrin,
Eugene F. Baulin,
Semyon Kolmykov,
Jeff Vierstra,
Fedor Kolpakov,
Vsevolod J. Makeev () and
Ivan V. Kulakovskiy ()
Additional contact information
Andrey Buyan: Russian Academy of Sciences
Georgy Meshcheryakov: Russian Academy of Sciences
Viacheslav Safronov: Lomonosov Moscow State University
Sergey Abramov: Russian Academy of Sciences
Alexandr Boytsov: Russian Academy of Sciences
Vladimir Nozdrin: Lomonosov Moscow State University
Eugene F. Baulin: Moscow Center for Advanced Studies
Semyon Kolmykov: Sirius University of Science and Technology
Jeff Vierstra: Altius Institute for Biomedical Sciences
Fedor Kolpakov: Sirius University of Science and Technology
Vsevolod J. Makeev: Russian Academy of Sciences
Ivan V. Kulakovskiy: Russian Academy of Sciences
Nature Communications, 2025, vol. 16, issue 1, 1-19
Abstract:
Abstract High-throughput sequencing facilitates large-scale studies of gene regulation and allows tracing the associations of individual genomic variants with changes in gene regulation and expression. Compared to classic association studies, the assessment of an allelic imbalance at heterozygous variants captures functional variant effects with smaller sample sizes, higher sensitivity, and better resolution. Yet, identification of allele-specific variants from allelic read counts remains challenging due to data-dependent biases and overdispersion arising from technical and biological variability. We present MIXALIME, a novel computational framework for calling allele-specific variants in diverse omics data with a repertoire of statistical models accounting for read mapping bias and copy number variation. We benchmark MIXALIME with DNase-Seq, ATAC-Seq, and CAGE-Seq data, and we demonstrate that the allelic imbalance highlights causal variants in GWAS results. Finally, as a showcase of the large-scale practical application of MIXALIME, we present an atlas of variants exhibiting allele-specific chromatin accessibility, built from thousands of available datasets obtained from diverse cell types.
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-024-55513-2 Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-024-55513-2
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-024-55513-2
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().