EconPapers    
Economics at your fingertips  
 

INSurVeyor: improving insertion calling from short read sequencing data

Ramesh Rajaby, Dong-Xu Liu, Chun Hang Au, Yuen-Ting Cheung, Amy Yuet Ting Lau, Qing-Yong Yang and Wing-Kin Sung ()
Additional contact information
Ramesh Rajaby: Hong Kong Science Park
Dong-Xu Liu: College of Informatics, Huazhong Agricultural University
Chun Hang Au: Hong Kong Science Park
Yuen-Ting Cheung: Hong Kong Science Park
Amy Yuet Ting Lau: Hong Kong Science Park
Qing-Yong Yang: College of Informatics, Huazhong Agricultural University
Wing-Kin Sung: Hong Kong Science Park

Nature Communications, 2023, vol. 14, issue 1, 1-13

Abstract: Abstract Insertions are one of the major types of structural variations and are defined as the addition of 50 nucleotides or more into a DNA sequence. Several methods exist to detect insertions from next-generation sequencing short read data, but they generally have low sensitivity. Our contribution is two-fold. First, we introduce INSurVeyor, a fast, sensitive and precise method that detects insertions from next-generation sequencing paired-end data. Using publicly available benchmark datasets (both human and non-human), we show that INSurVeyor is not only more sensitive than any individual caller we tested, but also more sensitive than all of them combined. Furthermore, for most types of insertions, INSurVeyor is almost as sensitive as long reads callers. Second, we provide state-of-the-art catalogues of insertions for 1047 Arabidopsis Thaliana genomes from the 1001 Genomes Project and 3202 human genomes from the 1000 Genomes Project, both generated with INSurVeyor. We show that they are more complete and precise than existing resources, and important insertions are missed by existing methods.

Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.nature.com/articles/s41467-023-38870-2 Abstract (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-38870-2

Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/

DOI: 10.1038/s41467-023-38870-2

Access Statistics for this article

Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie

More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-19
Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-38870-2