Accurate and scalable variant calling from single cell DNA sequencing data with ProSolo
David Lähnemann,
Johannes Köster,
Ute Fischer,
Arndt Borkhardt,
Alice C. McHardy () and
Alexander Schönhuth ()
Additional contact information
David Lähnemann: Helmholtz Centre for Infection Research
Johannes Köster: University of Duisburg-Essen
Ute Fischer: University Hospital, Medical Faculty, Heinrich Heine University Düsseldorf
Arndt Borkhardt: University Hospital, Medical Faculty, Heinrich Heine University Düsseldorf
Alice C. McHardy: Helmholtz Centre for Infection Research
Alexander Schönhuth: Centrum Wiskunde & Informatica
Nature Communications, 2021, vol. 12, issue 1, 1-11
Abstract:
Abstract Accurate single cell mutational profiles can reveal genomic cell-to-cell heterogeneity. However, sequencing libraries suitable for genotyping require whole genome amplification, which introduces allelic bias and copy errors. The resulting data violates assumptions of variant callers developed for bulk sequencing. Thus, only dedicated models accounting for amplification bias and errors can provide accurate calls. We present ProSolo for calling single nucleotide variants from multiple displacement amplified (MDA) single cell DNA sequencing data. ProSolo probabilistically models a single cell jointly with a bulk sequencing sample and integrates all relevant MDA biases in a site-specific and scalable—because computationally efficient—manner. This achieves a higher accuracy in calling and genotyping single nucleotide variants in single cells in comparison to state-of-the-art tools and supports imputation of insufficiently covered genotypes, when downstream tools cannot handle missing data. Moreover, ProSolo implements the first approach to control the false discovery rate reliably and flexibly. ProSolo is implemented in an extendable framework, with code and usage at: https://github.com/prosolo/prosolo
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-021-26938-w Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-26938-w
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-021-26938-w
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().