EconPapers    
Economics at your fingertips  
 

HiDDEN: a machine learning method for detection of disease-relevant populations in case-control single-cell transcriptomics data

Aleksandrina Goeva (), Michael-John Dolan, Judy Luu, Eric Garcia, Rebecca Boiarsky, Rajat M. Gupta and Evan Macosko ()
Additional contact information
Aleksandrina Goeva: Broad Institute of Massachusetts Institute of Technology and Harvard
Michael-John Dolan: Broad Institute of Massachusetts Institute of Technology and Harvard
Judy Luu: Broad Institute of Massachusetts Institute of Technology and Harvard
Eric Garcia: Broad Institute of Massachusetts Institute of Technology and Harvard
Rebecca Boiarsky: Broad Institute of Massachusetts Institute of Technology and Harvard
Rajat M. Gupta: Broad Institute of Massachusetts Institute of Technology and Harvard
Evan Macosko: Broad Institute of Massachusetts Institute of Technology and Harvard

Nature Communications, 2024, vol. 15, issue 1, 1-15

Abstract: Abstract In case-control single-cell RNA-seq studies, sample-level labels are transferred onto individual cells, labeling all case cells as affected, when in reality only a small fraction of them may actually be perturbed. Here, using simulations, we demonstrate that the standard approach to single cell analysis fails to isolate the subset of affected case cells and their markers when either the affected subset is small, or when the strength of the perturbation is mild. To address this fundamental limitation, we introduce HiDDEN, a computational method that refines the case-control labels to accurately reflect the perturbation status of each cell. We show HiDDEN’s superior ability to recover biological signals missed by the standard analysis workflow in simulated ground truth datasets of cell type mixtures. When applied to a dataset of human multiple myeloma precursor conditions, HiDDEN recapitulates the expert manual annotation and discovers malignancy in early stage samples missed in the original analysis. When applied to a mouse model of demyelination, HiDDEN identifies an endothelial subpopulation playing a role in early stage blood-brain barrier dysfunction. We anticipate that HiDDEN should find wide usage in contexts that require the detection of subtle transcriptional changes in cell types across conditions.

Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.nature.com/articles/s41467-024-53666-8 Abstract (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-53666-8

Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/

DOI: 10.1038/s41467-024-53666-8

Access Statistics for this article

Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie

More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-19
Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-53666-8