EconPapers    
Economics at your fingertips  
 

Detecting sample swaps in diverse NGS data types using linkage disequilibrium

Nauman Javed, Yossi Farjoun, Tim J. Fennell, Charles B. Epstein, Bradley E. Bernstein and Noam Shoresh ()
Additional contact information
Nauman Javed: Massachusetts General Hospital and Harvard Medical School
Yossi Farjoun: Broad Institute of MIT and Harvard
Tim J. Fennell: Broad Institute of MIT and Harvard
Charles B. Epstein: Broad Institute of MIT and Harvard
Bradley E. Bernstein: Massachusetts General Hospital and Harvard Medical School
Noam Shoresh: Broad Institute of MIT and Harvard

Nature Communications, 2020, vol. 11, issue 1, 1-8

Abstract: Abstract As the number of genomics datasets grows rapidly, sample mislabeling has become a high stakes issue. We present CrosscheckFingerprints (Crosscheck), a tool for quantifying sample-relatedness and detecting incorrectly paired sequencing datasets from different donors. Crosscheck outperforms similar methods and is effective even when data are sparse or from different assays. Application of Crosscheck to 8851 ENCODE ChIP-, RNA-, and DNase-seq datasets enabled us to identify and correct dozens of mislabeled samples and ambiguous metadata annotations, representing ~1% of ENCODE datasets.

Date: 2020
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.nature.com/articles/s41467-020-17453-5 Abstract (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-17453-5

Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/

DOI: 10.1038/s41467-020-17453-5

Access Statistics for this article

Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie

More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-19
Handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-17453-5