Supervised Machine Learning-Based Models for Predicting Raised Blood Sugar

Owess, Marwa Mustafa; Owda, Amani Yousef; Owda, Majdi; Massad, Salwa

Supervised Machine Learning-Based Models for Predicting Raised Blood Sugar

Marwa Mustafa Owess, Amani Yousef Owda (), Majdi Owda and Salwa Massad
Additional contact information
Marwa Mustafa Owess: Department of Natural, Engineering, and Technology Sciences, Arab American University, Ramallah P600, Palestine
Amani Yousef Owda: Department of Natural, Engineering, and Technology Sciences, Arab American University, Ramallah P600, Palestine
Majdi Owda: Faculty of Data Science, UNESCO Chair in Data Science for Sustainable Development, Arab American University, Ramallah P600, Palestine
Salwa Massad: The World Health Organization, Jerusalem P.O. Box 54812, Palestine

IJERPH, 2024, vol. 21, issue 7, 1-24

Abstract: Raised blood sugar (hyperglycemia) is considered a strong indicator of prediabetes or diabetes mellitus. Diabetes mellitus is one of the most common non-communicable diseases (NCDs) affecting the adult population. Recently, the prevalence of diabetes has been increasing at a faster rate, especially in developing countries. The primary concern associated with diabetes is the potential for serious health complications to occur if it is not diagnosed early. Therefore, timely detection and screening of diabetes is considered a crucial factor in treating and controlling the disease. Population screening for raised blood sugar aims to identify individuals at risk before symptoms appear, enabling timely intervention and potentially improved health outcomes. However, implementing large-scale screening programs can be expensive, requiring testing, follow-up, and management resources, potentially straining healthcare systems. Given the above facts, this paper presents supervised machine-learning models to detect and predict raised blood sugar. The proposed raised blood sugar models utilize diabetes-related risk factors including age, body mass index (BMI), eating habits, physical activity, prevalence of other diseases, and fasting blood sugar obtained from the dataset of the STEPwise approach to NCD risk factor study collected from adults in the Palestinian community. The diabetes risk factor obtained from the STEPS dataset was used as input for building the prediction model that was trained using various types of supervised learning classification algorithms including random forest, decision tree, Adaboost, XGBoost, bagging decision trees, and multi-layer perceptron (MLP). Based on the experimental results, the raised blood sugar models demonstrated optimal performance when implemented with a random forest classifier, yielding an accuracy of 98.4%. Followed by the bagging decision trees, XGBoost, MLP, AdaBoost, and decision tree with an accuracy of 97.4%, 96.4%, 96.3%, 95.2%, and 94.8%, respectively.

Keywords: raised blood sugar; diabetes; machine learning; prediction; classification (search for similar items in EconPapers)
JEL-codes: I I1 I3 Q Q5 (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1660-4601/21/7/840/pdf (application/pdf)
https://www.mdpi.com/1660-4601/21/7/840/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jijerp:v:21:y:2024:i:7:p:840-:d:1423613

Access Statistics for this article

IJERPH is currently edited by Ms. Jenna Liu

More articles in IJERPH from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().