GPS Pipeline: portable, scalable genomic pipeline for Streptococcus pneumoniae surveillance from Global Pneumococcal Sequencing Project
Harry C. H. Hung (),
Narender Kumar,
Victoria Dyster,
Corin Yeats,
Benjamin Metcalf,
Yuan Li,
Paulina A. Hawkins,
Lesley McGee,
Stephen D. Bentley and
Stephanie W. Lo ()
Additional contact information
Harry C. H. Hung: Wellcome Sanger Institute
Narender Kumar: Wellcome Sanger Institute
Victoria Dyster: Wellcome Sanger Institute
Corin Yeats: University of Oxford
Benjamin Metcalf: Centers for Disease Control and Prevention
Yuan Li: Centers for Disease Control and Prevention
Paulina A. Hawkins: Centers for Disease Control and Prevention
Lesley McGee: Centers for Disease Control and Prevention
Stephen D. Bentley: Wellcome Sanger Institute
Stephanie W. Lo: Wellcome Sanger Institute
Nature Communications, 2025, vol. 16, issue 1, 1-12
Abstract:
Abstract Streptococcus pneumoniae (pneumococcus) is a major pathogen globally, responsible for an estimated one million deaths annually and contributing significantly to the global burden of antimicrobial resistance. Ongoing surveillance of its vaccine antigen (i.e. serotypes), antimicrobial resistance, and pneumococcal lineages is crucial for assessing the impact of vaccination programs and guiding future vaccine design. However, current bioinformatics tools have several limitations that prevent them from enabling comprehensive analysis that allows simultaneous, large-scale, and independent generation of these crucial data. Here, we present the GPS Pipeline that enables reliable extraction of public health information from pneumococcal genomes using in silico methods. It can accurately identify 102 of 107 known serotypes, recognise 1053 pneumococcal lineages, and predict susceptibilities to 19 common antibiotics. Built on Nextflow and utilising containerisation technology, the GPS Pipeline minimises software setup requirements and bioinformatics expertise while facilitating large-scale analysis of genomic data. The GPS Pipeline was applied and validated on 20,924 pneumococcal genomes worldwide, demonstrating its effectiveness in enhancing responsiveness in pneumococcal genomic surveillance.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-025-64018-5 Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64018-5
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-025-64018-5
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().