EconPapers    
Economics at your fingertips  
 

P-NUT: Predicting NUTrient Content from Short Text Descriptions

Gordana Ispirova, Tome Eftimov and Barbara Koroušić Seljak
Additional contact information
Gordana Ispirova: Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
Tome Eftimov: Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
Barbara Koroušić Seljak: Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia

Mathematics, 2020, vol. 8, issue 10, 1-21

Abstract: Assessing nutritional content is very relevant for patients suffering from various diseases, professional athletes, and for health reasons is becoming part of everyday life for many. However, it is a very challenging task as it requires complete and reliable sources. We introduce a machine learning pipeline for predicting macronutrient values of foods using learned vector representations from short text descriptions of food products. On a dataset used from health specialists, containing short descriptions of foods and macronutrient values: we generate paragraph embeddings, introduce clustering in food groups, using graph-based vector representations, that include food domain knowledge information, and train regression models for each cluster. The predictions are for four macronutrients: carbohydrates, fat, protein and water. The highest accuracy was obtained for carbohydrate predictions – 86%, compared to the baseline – 27% and 36%. The protein predictions yielded the best results across all clusters, 53%–77% of the values fall in the tolerance-level range. These results were obtained using short descriptions, the embeddings can be improved if they are learned on longer descriptions, which would lead to better prediction results. Since the task of calculating macronutrients requires exact quantities of ingredients, these results obtained only from short description are a huge leap forward.

Keywords: macronutrient prediction; representation learning; machine learning; data mining; word embeddings; paragraph embeddings; single-target regression (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/8/10/1811/pdf (application/pdf)
https://www.mdpi.com/2227-7390/8/10/1811/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:8:y:2020:i:10:p:1811-:d:429336

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:8:y:2020:i:10:p:1811-:d:429336