WEPP: Phylogenetic placement achieves near-haplotype resolution in wastewater-based epidemiology
Pranav Gangwar,
Pratik Katte,
Manu Bhat and
Yatish Turakhia
PLOS Computational Biology, 2026, vol. 22, issue 3, 1-29
Abstract:
Wastewater-based epidemiology (WBE) is a cost-effective, unbiased, and time-efficient tool for public health surveillance. Although widely adopted since the COVID-19 pandemic, WBE remains underutilized in genomic epidemiology, as most tools are limited to lineage-level resolution and focus only on estimating lineage abundances from wastewater sequencing reads. Here, we present WEPP, a pathogen-agnostic pipeline that improves both the resolution and capabilities of WBE analysis. WEPP uses phylogenetic placement of sequencing reads onto mutation-annotated trees (MATs)—daily updated phylogenies of all globally available clinical sequences and their inferred ancestors—to sensitively and precisely identify a subset of haplotypes likely present in a sample. It also reports the abundance of each haplotype and lineage, and flags “unaccounted alleles”— those found in the sample but not explained by selected haplotypes—that may indicate novel variants. WEPP includes a powerful interactive dashboard for high-resolution visual analysis, allowing users to explore haplotype and lineage abundances, read-to-haplotype mappings, and unaccounted alleles within a global phylogenetic context. Applied to wastewater samples from multiple cities and pathogens, WEPP uncovered biological insights sometimes missed by other tools and enabled new WBE applications previously confined to clinical sequencing, such as identifying (i) intra-lineage haplotype clusters, (ii) multiple cluster introductions in a city, (iii) early haplotype detection up to five weeks before clinical confirmation, (iv) mutations from novel variants, and (v) circulating lineages missed by clinical surveillance. With these capabilities, WEPP can transform wastewater-based epidemiology into a more powerful tool for monitoring and managing infectious disease outbreaks.Author summary: Since the COVID-19 pandemic, wastewater-based epidemiology (WBE)—which analyzes sewage samples to detect traces of pathogens circulating in a community—has been widely adopted worldwide for public health surveillance. While more cost- and time-efficient than clinical testing, WBE is complicated by the fact that sewage contains only small fragments from a wide mix of pathogen variants infecting the community. Consequently, existing WBE tools have been limited to coarse resolution at the strain or lineage level. Our tool, WEPP, addresses this challenge by harnessing the vast global repository of clinical genome sequences to identify near-exact matches in wastewater. This enhanced resolution enables powerful new WBE applications, such as detecting emerging undesignated variants or tracking the global spread of locally circulating variants using a comprehensive database—capabilities previously confined to clinical surveillance.
Date: 2026
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1014124 (text/html)
https://journals.plos.org/ploscompbiol/article/fil ... 14124&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pcbi00:1014124
DOI: 10.1371/journal.pcbi.1014124
Access Statistics for this article
More articles in PLOS Computational Biology from Public Library of Science
Bibliographic data for series maintained by ploscompbiol ().