EconPapers    
Economics at your fingertips  
 

Application of Data Science for Cluster Analysis of COVID-19 Mortality According to Sociodemographic Factors at Municipal Level in Mexico

Joaquín Pérez-Ortega, Nelva Nely Almanza-Ortega, Kirvis Torres-Poveda, Gerardo Martínez-González, José Crispín Zavala-Díaz and Rodolfo Pazos-Rangel
Additional contact information
Joaquín Pérez-Ortega: Tecnológico Nacional de México/CENIDET, Cuernavaca 62490, Mexico
Nelva Nely Almanza-Ortega: Tecnológico Nacional de México/IT de Tlalnepantla, Tlalnepantla de Baz 54070, Mexico
Kirvis Torres-Poveda: Centro de Investigación Sobre Enfermedades Infecciosas, Instituto Nacional de Salud Pública, Cuernavaca 62100, Mexico
Gerardo Martínez-González: Tecnológico Nacional de México/CENIDET, Cuernavaca 62490, Mexico
José Crispín Zavala-Díaz: Administración e Informática, Facultad de Contaduría, Universidad Autónoma de Morelos, Cuernavaca 62209, Mexico
Rodolfo Pazos-Rangel: Tecnológico Nacional de México/IT de Cd. Madero, Madero 89440, Mexico

Mathematics, 2022, vol. 10, issue 13, 1-16

Abstract: Mexico is among the five countries with the largest number of reported deaths from COVID-19 disease, and the mortality rates associated to infections are heterogeneous in the country due to structural factors concerning population. This study aims at the analysis of clusters related to mortality rate from COVID-19 at the municipal level in Mexico from the perspective of Data Science. In this sense, a new application is presented that uses a machine learning hybrid algorithm for generating clusters of municipalities with similar values of sociodemographic indicators and mortality rates. To provide a systematic framework, we applied an extension of the International Business Machines Corporation (IBM) methodology called Batch Foundation Methodology for Data Science (FMDS). For the study, 1,086,743 death certificates corresponding to the year 2020 were used, among other official data. As a result of the analysis, two key indicators related to mortality from COVID-19 at the municipal level were identified: one is population density and the other is percentage of population in poverty. Based on these indicators, 16 municipality clusters were determined. Among the main results of this research, it was found that clusters with high values of mortality rate had high values of population density and low poverty levels. In contrast, clusters with low density values and high poverty levels had low mortality rates. Finally, we think that the patterns found, expressed as municipality clusters with similar characteristics, can be useful for decision making by health authorities regarding disease prevention and control for reinforcing public health measures and optimizing resource distribution for reducing hospitalizations and mortality.

Keywords: clustering; COVID-19; Data Science; Data Science methodology; epidemiology; machine learning; pandemic; unsupervised learning (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/10/13/2167/pdf (application/pdf)
https://www.mdpi.com/2227-7390/10/13/2167/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:10:y:2022:i:13:p:2167-:d:844746

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:10:y:2022:i:13:p:2167-:d:844746