EconPapers    
Economics at your fingertips  
 

A Unified Bayesian Framework for Bi-overlapping-Clustering Multi-omics Data via Sparse Matrix Factorization

Fangting Zhou, Kejun He (), James J. Cai, Laurie A. Davidson, Robert S. Chapkin and Yang Ni ()
Additional contact information
Fangting Zhou: Renmin University of China
Kejun He: Renmin University of China
James J. Cai: Texas A&M University
Laurie A. Davidson: Texas A&M University
Robert S. Chapkin: Texas A&M University
Yang Ni: Texas A&M University

Statistics in Biosciences, 2023, vol. 15, issue 3, No 8, 669-691

Abstract: Abstract The advances of modern sequencing techniques have generated an unprecedented amount of multi-omics data which provide great opportunities to quantitatively explore functional genomes from different but complementary perspectives. However, distinct modalities/sequencing technologies generate diverse types of data which greatly complicate statistical modeling because uniquely optimized methods are required for handling each type of data. In this paper, we propose a unified framework for Bayesian nonparametric matrix factorization that infers overlapping bi-clusters for multi-omics data. The proposed method adaptively discretizes different types of observations into common latent states on which cluster structures are built hierarchically. The proposed Bayesian nonparametric method is able to automatically determine the number of clusters. We demonstrate the utility of the proposed method using simulation studies and applications to a single-cell RNA-sequencing dataset, a combination of single-cell RNA-sequencing and single-cell ATAC-sequencing dataset, a bulk RNA-sequencing dataset, and a DNA methylation dataset which reveal several interesting findings that are consistent with biological literature.

Keywords: Bayesian nonparametric prior; Data integration; Indian buffet process; Mixture model; Single-cell sequencing (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s12561-022-09350-w Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:stabio:v:15:y:2023:i:3:d:10.1007_s12561-022-09350-w

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/12561

DOI: 10.1007/s12561-022-09350-w

Access Statistics for this article

Statistics in Biosciences is currently edited by Hongyu Zhao and Xihong Lin

More articles in Statistics in Biosciences from Springer, International Chinese Statistical Association
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-26
Handle: RePEc:spr:stabio:v:15:y:2023:i:3:d:10.1007_s12561-022-09350-w