EconPapers    
Economics at your fingertips  
 

The Prediction of Diabetes

Alessandro Massaro, Nicola Magaletti, Gabriele Cosoli, Vito O. M. Giardinelli and Angelo Leogrande

MPRA Paper from University Library of Munich, Germany

Abstract: The following article presents an analysis of the determinants of diabetes using a dataset containing the surveys of 2000 patients from the Frankfurt Hospital in Germany. The data were analyzed using the following models, namely: Tobit, Probit, Logit, Multinomial Logit, OLS, WLS with heteroskedasticity. The results show that the presence of diabetes is positively associated with "Pregnancies", "Glucose", "BMI", "Diabetes Pedigree Function", "Age" and negatively associated with "Blood Pressure". A cluster analysis is realized using the fuzzy c-Means algorithm optimized with the Elbow method and three clusters were found. Finally a confrontation among eight different machine learning algorithms is realized to select the best performing algorithm to predict the probability of patients to develop diabetes.

Keywords: Machine Learning; Clusterization; Elbow Method; Prediction; Correlation Matrix; Principal Component Analysis; Binary and non-Binary regression models. (search for similar items in EconPapers)
JEL-codes: I10 I11 I12 I13 I14 I15 I18 (search for similar items in EconPapers)
Date: 2022-06-13
New Economics Papers: this item is included in nep-big, nep-for and nep-hea
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://mpra.ub.uni-muenchen.de/113372/1/MPRA_paper_113372.pdf original version (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:pra:mprapa:113372

Access Statistics for this paper

More papers in MPRA Paper from University Library of Munich, Germany Ludwigstraße 33, D-80539 Munich, Germany. Contact information at EDIRC.
Bibliographic data for series maintained by Joachim Winter ().

 
Page updated 2025-03-22
Handle: RePEc:pra:mprapa:113372