EconPapers    
Economics at your fingertips  
 

PGAP2: A comprehensive toolkit for prokaryotic pan-genome analysis based on fine-grained feature networks

Congfan Bu, Hao Zhang, Fengnian Zhang, Wenhao Liang, Hao Gao, Jing Zhao, Fangming Lv, Ruikun Xue, Qian Liu, Zhewen Zhang, Zhong Jin () and Jingfa Xiao ()
Additional contact information
Congfan Bu: China National Center for Bioinformation
Hao Zhang: China National Center for Bioinformation
Fengnian Zhang: University of Chinese Academy of Sciences
Wenhao Liang: University of Chinese Academy of Sciences
Hao Gao: China National Center for Bioinformation
Jing Zhao: China National Center for Bioinformation
Fangming Lv: China National Center for Bioinformation
Ruikun Xue: China National Center for Bioinformation
Qian Liu: Chinese Academy of Sciences
Zhewen Zhang: China National Center for Bioinformation
Zhong Jin: Chinese Academy of Sciences
Jingfa Xiao: China National Center for Bioinformation

Nature Communications, 2025, vol. 16, issue 1, 1-13

Abstract: Abstract Pan-genome analysis is a crucial method for studying genomic dynamics. By creating pan-genome maps for prokaryotic organisms, we can gain valuable insights into their genetic diversity and ecological adaptability. However, current analytical methods often struggle to balance accuracy and computational efficiency, and they tend to provide primarily qualitative results. This study introduces PGAP2, an integrated software package that simplifies various processes, including data quality control, pan-genome analysis, and result visualization. PGAP2 facilitates the rapid and accurate identification of orthologous and paralogous genes by employing fine-grained feature analysis within constrained regions. Our systematic evaluation with simulated and gold-standard datasets demonstrates that PGAP2 is more precise, robust, and scalable than state-of-the-art tools for large-scale pan-genome data. Furthermore, PGAP2 introduces four quantitative parameters derived from the distances between or within clusters, enabling detailed characterization of homology clusters. Finally, we validate our quantitative findings by applying PGAP2 to construct a pan-genomic profile of 2794 zoonotic Streptococcus suis strains. This analysis offers new insights into the genetic diversity of S. suis, thereby enhancing our understanding of its genomic structure. PGAP2 is freely available at https://github.com/bucongfan/PGAP2 .

Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.nature.com/articles/s41467-025-64846-5 Abstract (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64846-5

Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/

DOI: 10.1038/s41467-025-64846-5

Access Statistics for this article

Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie

More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-12-06
Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64846-5