Facilitate integrated analysis of single cell multiomic data by binarizing gene expression values
Rohan Misra,
Alexander Ferrena and
Deyou Zheng ()
Additional contact information
Rohan Misra: Albert Einstein College of Medicine
Alexander Ferrena: Albert Einstein College of Medicine
Deyou Zheng: Albert Einstein College of Medicine
Nature Communications, 2025, vol. 16, issue 1, 1-12
Abstract:
Abstract A cell type’s identity can be revealed by its transcriptome and epigenome profiles, both of which can be in flux temporally and spatially, leading to distinct cell states or subtypes. The popular and standard workflow for single cell RNA-seq (scRNA-seq) data analysis applies feature selection, dimensional reduction, and clustering on the gene expression values quantified by read counts, but alternative approaches using a simple classification of a gene to “on” and “off” (i.e., binarization of the gene expression) have been proposed for clustering cells and other downstream analyses. Here, we demonstrate that a direct concatenation of the binarized scRNA-seq data and the standard single cell ATAC-seq data is sufficient and effective for vertical integrated clustering analysis, after applying term-frequency-inverse document frequency (TF-IDF) and single value decomposition (also called latent semantic indexing, LSI) algorithms to the combined data, when the two data modalities are collected using a paired multiomic technology. This proposed approach avoids the need for converting scATAC-seq data to gene activity scores for combined analysis. Furthermore it enables a direct investigation into the contribution of each data type for resolving cell type or subtype identity.
Date: 2025
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-025-60899-8 Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-60899-8
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-025-60899-8
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().