Correcting for sparsity and interdependence in glycomics by accounting for glycan biosynthesis
Bokan Bao,
Benjamin P. Kellman,
Austin W. T. Chiang,
Yujie Zhang,
James T. Sorrentino,
Austin K. York,
Mahmoud A. Mohammad,
Morey W. Haymond,
Lars Bode and
Nathan E. Lewis ()
Additional contact information
Bokan Bao: University of California, San Diego
Benjamin P. Kellman: University of California, San Diego
Austin W. T. Chiang: University of California, San Diego
Yujie Zhang: University of California, San Diego
James T. Sorrentino: University of California, San Diego
Austin K. York: University of California, San Diego
Mahmoud A. Mohammad: Baylor College of Medicine
Morey W. Haymond: Baylor College of Medicine
Lars Bode: University of California, San Diego
Nathan E. Lewis: University of California, San Diego
Nature Communications, 2021, vol. 12, issue 1, 1-14
Abstract:
Abstract Glycans are fundamental cellular building blocks, involved in many organismal functions. Advances in glycomics are elucidating the essential roles of glycans. Still, it remains challenging to properly analyze large glycomics datasets, since the abundance of each glycan is dependent on many other glycans that share many intermediate biosynthetic steps. Furthermore, the overlap of measured glycans can be low across samples. We address these challenges with GlyCompare, a glycomic data analysis approach that accounts for shared biosynthetic steps for all measured glycans to correct for sparsity and non-independence in glycomics, which enables direct comparison of different glycoprofiles and increases statistical power. Using GlyCompare, we study diverse N-glycan profiles from glycoengineered erythropoietin. We obtain biologically meaningful clustering of mutant cell glycoprofiles and identify knockout-specific effects of fucosyltransferase mutants on tetra-antennary structures. We further analyze human milk oligosaccharide profiles and find mother’s fucosyltransferase-dependent secretor-status indirectly impact the sialylation. Finally, we apply our method on mucin-type O-glycans, gangliosides, and site-specific compositional glycosylation data to reveal tissues and disease-specific glycan presentations. Our substructure-oriented approach will enable researchers to take full advantage of the growing power and size of glycomics data.
Date: 2021
References: Add references at CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www.nature.com/articles/s41467-021-25183-5 Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-25183-5
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-021-25183-5
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().