Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages
Marylène Rugard,
Thomas Jaylet,
Olivier Taboureau,
Anne Tromelin and
Karine Audouze
PLOS ONE, 2021, vol. 16, issue 5, 1-17
Abstract:
This study aims to highlight the relationships between the structure of smell compounds and their odors. For this purpose, heterogeneous data sources were screened, and 6038 odorant compounds and their known associated odors (162 odor notes) were compiled, each individual molecule being represented with a set of 1024 structural fingerprint. Several dimensional reduction techniques (PCA, MDS, t-SNE and UMAP) with two clustering methods (k-means and agglomerative hierarchical clustering AHC) were assessed based on the calculated fingerprints. The combination of UMAP with k-means and AHC methods allowed to obtain a good representativeness of odors by clusters, as well as the best visualization of the proximity of odorants on the basis of their molecular structures. The presence or absence of molecular substructures has been calculated on odorant in order to link chemical groups to odors. The results of this analysis bring out some associations for both the odor notes and the chemical structures of the molecules such as “woody” and “spicy” notes with allylic and bicyclic structures, “balsamic” notes with unsaturated rings, both “sulfurous” and “citrus” with aldehydes, alcohols, carboxylic acids, amines and sulfur compounds, and “oily”, “fatty” and “fruity” characterized by esters and with long carbon chains. Overall, the use of UMAP associated to clustering is a promising method to suggest hypotheses on the odorant structure-odor relationships.
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0252486 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 52486&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0252486
DOI: 10.1371/journal.pone.0252486
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().