Shapley Value as a Quality Control for Mass Spectra of Human Glioblastoma Tissues
Denis S. Zavorotnyuk,
Anatoly A. Sorokin,
Stanislav I. Pekov (),
Denis S. Bormotov,
Vasiliy A. Eliferov,
Konstantin V. Bocharov,
Eugene N. Nikolaev () and
Igor A. Popov ()
Additional contact information
Denis S. Zavorotnyuk: The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia
Anatoly A. Sorokin: The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia
Stanislav I. Pekov: Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
Denis S. Bormotov: The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia
Vasiliy A. Eliferov: The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia
Konstantin V. Bocharov: V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Science, 119334 Moscow, Russia
Eugene N. Nikolaev: Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
Igor A. Popov: The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia
Data, 2023, vol. 8, issue 1, 1-9
Abstract:
The automatic processing of high-dimensional mass spectrometry data is required for the clinical implementation of ambient ionization molecular profiling methods. However, complex algorithms required for the analysis of peak-rich spectra are sensitive to the quality of the input data. Therefore, an objective and quantitative indicator, insensitive to the conditions of the experiment, is currently in high demand for the automated treatment of mass spectrometric data. In this work, we demonstrate the utility of the Shapley value as an indicator of the quality of the individual mass spectrum in the classification task for human brain tumor tissue discrimination. The Shapley values are calculated on the training set of glioblastoma and nontumor pathological tissues spectra and used as feedback to create a random forest regression model to estimate the contributions for all spectra of each specimen. As a result, it is shown that the implementation of Shapley values significantly accelerates the data analysis of negative mode mass spectrometry data alongside simultaneous improving the regression models’ accuracy.
Keywords: ambient ionization mass spectrometry; Shapley value; classification (search for similar items in EconPapers)
JEL-codes: C8 C80 C81 C82 C83 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2306-5729/8/1/21/pdf (application/pdf)
https://www.mdpi.com/2306-5729/8/1/21/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jdataj:v:8:y:2023:i:1:p:21-:d:1037357
Access Statistics for this article
Data is currently edited by Ms. Cecilia Yang
More articles in Data from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().