NutritionVerse3D2D: Large 3D Object and 2D Image Food Dataset for Dietary Intake Estimation
Chi-en Amy Tai (),
Matthew Keller,
Saeejith Nair,
Yuhao Chen,
Yifan Wu,
Olivia Markham,
Krish Parmar,
Pengcheng Xi and
Alexander Wong
Additional contact information
Chi-en Amy Tai: Department of Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Matthew Keller: Department of Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Saeejith Nair: Department of Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Yuhao Chen: Department of Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Yifan Wu: Department of Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Olivia Markham: Department of Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Krish Parmar: Department of Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Pengcheng Xi: National Research Council Canada, Ottawa, ON K1A 0R6, Canada
Alexander Wong: Department of Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Data, 2025, vol. 10, issue 11, 1-22
Abstract:
Elderly populations often face significant challenges when it comes to dietary intake tracking, often exacerbated by health complications. Unfortunately, conventional diet assessment techniques such as food frequency questionnaires, food diaries, and 24 h recall are subject to substantial bias. Recent advancements in machine learning and computer vision show promise of automated nutrition tracking methods of food, but require a large, high-quality dataset in order to accurately identify the nutrients from the food on the plate. However, manual creation of large-scale datasets with such diversity is time-consuming and hard to scale. On the other hand, synthesized 3D food models enable view augmentation to generate countless photorealistic 2D renderings from any viewpoint, reducing imbalance across camera angles. In this paper, we present a process to collect a large image dataset of food scenes that span diverse viewpoints and highlight its usage in dietary intake estimation. We first collect quality 3D objects of food items (NV-3D) that are used to generate photorealistic synthetic 2D food images (NV-Synth) and then manually collect a validation 2D food image dataset (NV-Real). We benchmark various intake estimation approaches on these datasets and present NutritionVerse3D2D, a collection of datasets that contain 3D objects and 2D images, along with models that estimate intake from the 2D food images. We release all the datasets along with the developed models to accelerate machine learning research on dietary sensing.
Keywords: datasets; dietary; segmentation; prediction (search for similar items in EconPapers)
JEL-codes: C8 C80 C81 C82 C83 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2306-5729/10/11/180/pdf (application/pdf)
https://www.mdpi.com/2306-5729/10/11/180/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jdataj:v:10:y:2025:i:11:p:180-:d:1787132
Access Statistics for this article
Data is currently edited by Ms. Becky Zhang
More articles in Data from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().