Discovering organic reactions with a machine-learning-powered deciphering of tera-scale mass spectrometry data
Konstantin S. Kozlov,
Daniil A. Boiko,
Julia V. Burykina,
Valentina V. Ilyushenkova,
Alexander Y. Kostyukovich,
Ekaterina D. Patil and
Valentine P. Ananikov ()
Additional contact information
Konstantin S. Kozlov: Russian Academy of Sciences
Daniil A. Boiko: Russian Academy of Sciences
Julia V. Burykina: Russian Academy of Sciences
Valentina V. Ilyushenkova: Russian Academy of Sciences
Alexander Y. Kostyukovich: Russian Academy of Sciences
Ekaterina D. Patil: Russian Academy of Sciences
Valentine P. Ananikov: Russian Academy of Sciences
Nature Communications, 2025, vol. 16, issue 1, 1-12
Abstract:
Abstract The accumulation of large datasets by the scientific community has surpassed the capacity of traditional processing methods, underscoring the critical need for innovative and efficient algorithms capable of navigating through extensive existing experimental data. Addressing this challenge, our study introduces a machine learning (ML)-powered search engine specifically tailored for analyzing tera-scale high-resolution mass spectrometry (HRMS) data. This engine harnesses a novel isotope-distribution-centric search algorithm augmented by two synergistic ML models, assisting with the discovery of hitherto unknown chemical reactions. This methodology enables the rigorous investigation of existing data, thus providing efficient support for chemical hypotheses while reducing the need for conducting additional experiments. Moreover, we extend this approach with baseline methods for automated reaction hypothesis generation. In its practical validation, our approach successfully identified several reactions, unveiling previously undescribed transformations. Among these, the heterocycle-vinyl coupling process within the Mizoroki-Heck reaction stands out, highlighting the capability of the engine to elucidate complex chemical phenomena.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-025-56905-8 Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-56905-8
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-025-56905-8
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().