EconPapers    
Economics at your fingertips  
 

Large protein databases reveal structural complementarity and functional locality

Paweł Szczerbiak, Lukasz M. Szydlowski, Witold Wydmański, P. Douglas Renfrew, Julia Koehler Leman and Tomasz Kosciolek ()
Additional contact information
Paweł Szczerbiak: Sano Centre for Computational Medicine
Lukasz M. Szydlowski: Sano Centre for Computational Medicine
Witold Wydmański: Jagiellonian University
P. Douglas Renfrew: Flatiron Institute
Julia Koehler Leman: Open Molecular Software Foundation
Tomasz Kosciolek: Sano Centre for Computational Medicine

Nature Communications, 2025, vol. 16, issue 1, 1-15

Abstract: Abstract Recent breakthroughs in protein structure prediction have led to a surge in high-quality 3D models, highlighting the need for efficient computational solutions. In our work, we examine the structural clusters from the AlphaFold Protein Structure Database (AFDB), a high-quality subset of ESMAtlas, and the Microbiome Immunity Project (MIP). We create a single cohesive low-dimensional representation of the resulting protein space. We show that, while each database occupies distinct regions, they collectively exhibit significant overlap in their functional profiles. High-level biological functions tend to cluster in particular regions, revealing a shared functional landscape despite the diverse sources of data. By creating a representation of protein structure space, localizing functional annotations within this space, and providing an open-access web-server for exploration, this work offers insights for future research concerning protein sequence-structure-function relationships, enabling biological questions to be asked about taxonomic assignments, environmental factors, or functional specificity. This approach is generalizable, thus enabling further discovery beyond findings presented here.

Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.nature.com/articles/s41467-025-63250-3 Abstract (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-63250-3

Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/

DOI: 10.1038/s41467-025-63250-3

Access Statistics for this article

Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie

More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-08-27
Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-63250-3