EconPapers    
Economics at your fingertips  
 

GPS Pipeline: portable, scalable genomic pipeline for Streptococcus pneumoniae surveillance from Global Pneumococcal Sequencing Project

Harry C. H. Hung (), Narender Kumar, Victoria Dyster, Corin Yeats, Benjamin Metcalf, Yuan Li, Paulina A. Hawkins, Lesley McGee, Stephen D. Bentley and Stephanie W. Lo ()
Additional contact information
Harry C. H. Hung: Wellcome Sanger Institute
Narender Kumar: Wellcome Sanger Institute
Victoria Dyster: Wellcome Sanger Institute
Corin Yeats: University of Oxford
Benjamin Metcalf: Centers for Disease Control and Prevention
Yuan Li: Centers for Disease Control and Prevention
Paulina A. Hawkins: Centers for Disease Control and Prevention
Lesley McGee: Centers for Disease Control and Prevention
Stephen D. Bentley: Wellcome Sanger Institute
Stephanie W. Lo: Wellcome Sanger Institute

Nature Communications, 2025, vol. 16, issue 1, 1-12

Abstract: Abstract Streptococcus pneumoniae (pneumococcus) is a major pathogen globally, responsible for an estimated one million deaths annually and contributing significantly to the global burden of antimicrobial resistance. Ongoing surveillance of its vaccine antigen (i.e. serotypes), antimicrobial resistance, and pneumococcal lineages is crucial for assessing the impact of vaccination programs and guiding future vaccine design. However, current bioinformatics tools have several limitations that prevent them from enabling comprehensive analysis that allows simultaneous, large-scale, and independent generation of these crucial data. Here, we present the GPS Pipeline that enables reliable extraction of public health information from pneumococcal genomes using in silico methods. It can accurately identify 102 of 107 known serotypes, recognise 1053 pneumococcal lineages, and predict susceptibilities to 19 common antibiotics. Built on Nextflow and utilising containerisation technology, the GPS Pipeline minimises software setup requirements and bioinformatics expertise while facilitating large-scale analysis of genomic data. The GPS Pipeline was applied and validated on 20,924 pneumococcal genomes worldwide, demonstrating its effectiveness in enhancing responsiveness in pneumococcal genomic surveillance.

Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.nature.com/articles/s41467-025-64018-5 Abstract (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64018-5

Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/

DOI: 10.1038/s41467-025-64018-5

Access Statistics for this article

Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie

More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-09-26
Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64018-5