EconPapers    
Economics at your fingertips  
 

ClairS-TO: a deep-learning method for long-read tumor-only somatic small variant calling

Lei Chen, Zhenxian Zheng (), Junhao Su, Xian Yu, Angel On Ki Wong, Jingcheng Zhang, Yan-Lam Lee and Ruibang Luo ()
Additional contact information
Lei Chen: The University of Hong Kong
Zhenxian Zheng: The University of Hong Kong
Junhao Su: The University of Hong Kong
Xian Yu: The University of Hong Kong
Angel On Ki Wong: The University of Hong Kong
Jingcheng Zhang: The University of Hong Kong
Yan-Lam Lee: The University of Hong Kong
Ruibang Luo: The University of Hong Kong

Nature Communications, 2025, vol. 16, issue 1, 1-15

Abstract: Abstract Accurate detection of somatic variants in tumors is of critical importance and remains challenging. Current methods typically require matched normal samples for reliable detection, which are often unavailable in real-world research and clinical scenarios. Without a matched normal sample, more proficient algorithms are required to distinguish true somatic variants from germline variants and technical artifacts. However, existing tumor-only somatic variant callers that were designed for short-read sequencing data are not able to work well with long-read data. To fill the gap, we present ClairS-TO, a deep-learning-based method for long-read tumor-only somatic variant calling. ClairS-TO uses an ensemble of two disparate neural networks trained from the same samples but for opposite tasks—how likely/not likely a candidate is a somatic variant. Benchmarks using COLO829 and HCC1395 cancer cell lines show that ClairS-TO outperforms DeepSomatic and smrest in ONT and PacBio long-read data. ClairS-TO is also applicable to short-read data and outperforms Mutect2, Octopus, Pisces, and DeepSomatic. Extensive experiments across various sequencing coverages, variant allelic fractions, and tumor purities support that ClairS-TO is a reliable tool for somatic variant discovery. ClairS-TO is open-source, available at https://github.com/HKU-BAL/ClairS-TO .

Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.nature.com/articles/s41467-025-64547-z Abstract (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64547-z

Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/

DOI: 10.1038/s41467-025-64547-z

Access Statistics for this article

Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie

More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-11-02
Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64547-z