Benchmarking of cell type deconvolution pipelines for transcriptomics data
Francisco Avila Cobos (),
José Alquicira-Hernandez,
Joseph E. Powell,
Pieter Mestdagh and
Katleen De Preter ()
Additional contact information
Francisco Avila Cobos: Ghent University
José Alquicira-Hernandez: Garvan Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research
Joseph E. Powell: Garvan Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research
Pieter Mestdagh: Ghent University
Katleen De Preter: Ghent University
Nature Communications, 2020, vol. 11, issue 1, 1-14
Abstract:
Abstract Many computational methods have been developed to infer cell type proportions from bulk transcriptomics data. However, an evaluation of the impact of data transformation, pre-processing, marker selection, cell type composition and choice of methodology on the deconvolution results is still lacking. Using five single-cell RNA-sequencing (scRNA-seq) datasets, we generate pseudo-bulk mixtures to evaluate the combined impact of these factors. Both bulk deconvolution methodologies and those that use scRNA-seq data as reference perform best when applied to data in linear scale and the choice of normalization has a dramatic impact on some, but not all methods. Overall, methods that use scRNA-seq data have comparable performance to the best performing bulk methods whereas semi-supervised approaches show higher error values. Moreover, failure to include cell types in the reference that are present in a mixture leads to substantially worse results, regardless of the previous choices. Altogether, we evaluate the combined impact of factors affecting the deconvolution task across different datasets and propose general guidelines to maximize its performance.
Date: 2020
References: Add references at CitEc
Citations: View citations in EconPapers (11)
Downloads: (external link)
https://www.nature.com/articles/s41467-020-19015-1 Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-19015-1
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-020-19015-1
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().