Machine learning on national shopping data reliably estimates childhood obesity prevalence and socio-economic deprivation
Gavin Long,
Georgiana Nica-Avram,
John Harvey,
Evgeniya Lukinova,
Roberto Mansilla,
Simon Welham,
Gregor Engelmann,
Elizabeth Dolan,
Kuzivakwashe Makokoro,
Michelle Thomas,
Edward Powell and
James Goulding
Food Policy, 2025, vol. 131, issue C
Abstract:
Deprivation pushes people to choose cheap, calorie-dense foods instead of nutritious but expensive alternatives. Diseases, such as obesity, cardiovascular disease, and diabetes, resulting from these poor dietary choices place a significant burden on public health systems. Measuring nutritional insecurity is difficult to achieve at scale and so the ability to study the relationship between nutritional outcomes and deprivation at a national level is very challenging. This makes it difficult to understand the effect of new policies or track changes over time. To address this challenge, we develop a machine learning approach using massive anonymised transactional data (4 million members and 2.5 billion transactions) in partnership with the retailer The Co-operative Group UK. We engineer a series of variables related to obesogenic diets, including a new measure called ‘Calorie-oriented purchasing’. These variables help illustrate how large-scale transactional data can discriminate between neighbourhoods most affected by deprivation and childhood obesity. Through comparative assessment of machine learning approaches, we find better performance from tree-based models (Random Forest, XGBoost) with the best-achieving accuracy of 0.88 for predicting deprivation and an accuracy of 0.79 for childhood obesity. Calorie-oriented purchasing emerges as a robust predictor of deprivation and childhood obesity at the census area level. Results show this approach can help summarise nutritional insecurity, and support its spatio-temporal monitoring. We conclude with policy implications and recommend retailers adopt new measures for measuring national nutrition insecurity.
Keywords: Deprivation; Obesity; Machine learning; Dietary Monitoring; Digital Footprints; Food Security (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0306919225000302
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:jfpoli:v:131:y:2025:i:c:s0306919225000302
DOI: 10.1016/j.foodpol.2025.102826
Access Statistics for this article
Food Policy is currently edited by J. Kydd
More articles in Food Policy from Elsevier
Bibliographic data for series maintained by Catherine Liu ().