Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity

Fröhlich, Klemens; Brombacher, Eva; Fahrner, Matthias; Vogele, Daniel; Kook, Lucas; Pinter, Niko; Bronsert, Peter; Timme-Bronsert, Sylvia; Schmidt, Alexander; Bärenfaller, Katja; Kreutz, Clemens; Schilling, Oliver

Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity

Klemens Fröhlich, Eva Brombacher, Matthias Fahrner, Daniel Vogele, Lucas Kook, Niko Pinter, Peter Bronsert, Sylvia Timme-Bronsert, Alexander Schmidt, Katja Bärenfaller, Clemens Kreutz and Oliver Schilling ()
Additional contact information
Klemens Fröhlich: Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg
Eva Brombacher: University of Freiburg
Matthias Fahrner: Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg
Daniel Vogele: Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg
Lucas Kook: University of Zurich
Niko Pinter: Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg
Peter Bronsert: Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg
Sylvia Timme-Bronsert: Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg
Alexander Schmidt: University of Basel
Katja Bärenfaller: University of Zurich, and Swiss Institute of Bioinformatics (SIB)
Clemens Kreutz: Faculty of Medicine and Medical Center – University of Freiburg
Oliver Schilling: Medical Center—University of Freiburg, Faculty of Medicine, University of Freiburg

Nature Communications, 2022, vol. 13, issue 1, 1-13

Abstract: Abstract Numerous software tools exist for data-independent acquisition (DIA) analysis of clinical samples, necessitating their comprehensive benchmarking. We present a benchmark dataset comprising real-world inter-patient heterogeneity, which we use for in-depth benchmarking of DIA data analysis workflows for clinical settings. Combining spectral libraries, DIA software, sparsity reduction, normalization, and statistical tests results in 1428 distinct data analysis workflows, which we evaluate based on their ability to correctly identify differentially abundant proteins. From our dataset, we derive bootstrap datasets of varying sample sizes and use the whole range of bootstrap datasets to robustly evaluate each workflow. We find that all DIA software suites benefit from using a gas-phase fractionated spectral library, irrespective of the library refinement used. Gas-phase fractionation-based libraries perform best against two out of three reference protein lists. Among all investigated statistical tests non-parametric permutation-based statistical tests consistently perform best.

Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.nature.com/articles/s41467-022-30094-0 Abstract (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-30094-0

Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/

DOI: 10.1038/s41467-022-30094-0

Access Statistics for this article

Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie

More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().