Accurate, scalable and integrative haplotype estimation
Olivier Delaneau (),
Jean-François Zagury,
Matthew R. Robinson,
Jonathan L. Marchini and
Emmanouil T. Dermitzakis
Additional contact information
Olivier Delaneau: University of Lausanne, Génopode
Jean-François Zagury: HESAM Université
Matthew R. Robinson: University of Lausanne, Génopode
Jonathan L. Marchini: University of Oxford
Emmanouil T. Dermitzakis: University of Geneva Medical School
Nature Communications, 2019, vol. 10, issue 1, 1-10
Abstract:
Abstract The number of human genomes being genotyped or sequenced increases exponentially and efficient haplotype estimation methods able to handle this amount of data are now required. Here we present a method, SHAPEIT4, which substantially improves upon other methods to process large genotype and high coverage sequencing datasets. It notably exhibits sub-linear running times with sample size, provides highly accurate haplotypes and allows integrating external phasing information such as large reference panels of haplotypes, collections of pre-phased variants and long sequencing reads. We provide SHAPEIT4 in an open source format and demonstrate its performance in terms of accuracy and running times on two gold standard datasets: the UK Biobank data and the Genome In A Bottle.
Date: 2019
References: Add references at CitEc
Citations: View citations in EconPapers (15)
Downloads: (external link)
https://www.nature.com/articles/s41467-019-13225-y Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:10:y:2019:i:1:d:10.1038_s41467-019-13225-y
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-019-13225-y
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().