EconPapers    
Economics at your fingertips  
 

Respiratory Diseases, Malaria and Leishmaniasis: Temporal and Spatial Association with Fire Occurrences from Knowledge Discovery and Data Mining

Lucas Schroeder, Mauricio Roberto Veronez, Eniuce Menezes de Souza, Diego Brum, Luiz Gonzaga and Vinicius Francisco Rofatto
Additional contact information
Lucas Schroeder: X|Reality and Geoinformatics Lab., Vale do Rio dos Sinos University, São Leopoldo 93022-750, Brazil
Mauricio Roberto Veronez: X|Reality and Geoinformatics Lab., Vale do Rio dos Sinos University, São Leopoldo 93022-750, Brazil
Eniuce Menezes de Souza: Department of Statistics, State University of Maringá, Maringá 87020-900, Brazil
Diego Brum: X|Reality and Geoinformatics Lab., Vale do Rio dos Sinos University, São Leopoldo 93022-750, Brazil
Luiz Gonzaga: X|Reality and Geoinformatics Lab., Vale do Rio dos Sinos University, São Leopoldo 93022-750, Brazil
Vinicius Francisco Rofatto: Department of Geography, Federal University of Uberlândia, Uberlândia 38408-100, Brazil

IJERPH, 2020, vol. 17, issue 10, 1-23

Abstract: The relationship between the fires occurrences and diseases is an essential issue for making public health policy and environment protecting strategy. Thanks to the Internet, today, we have a huge amount of health data and fire occurrence reports at our disposal. The challenge, therefore, is how to deal with 4 Vs (volume, variety, velocity and veracity) associated with these data. To overcome this problem, in this paper, we propose a method that combines techniques based on Data Mining and Knowledge Discovery from Databases (KDD) to discover spatial and temporal association between diseases and the fire occurrences. Here, the case study was addressed to Malaria, Leishmaniasis and respiratory diseases in Brazil. Instead of losing a lot of time verifying the consistency of the database, the proposed method uses Decision Tree, a machine learning-based supervised classification, to perform a fast management and extract only relevant and strategic information, with the knowledge of how reliable the database is. Namely, States, Biomes and period of the year (months) with the highest rate of fires could be identified with great success rates and in few seconds. Then, the K-means, an unsupervised learning algorithms that solves the well-known clustering problem, is employed to identify the groups of cities where the fire occurrences is more expressive. Finally, the steps associated with KDD is perfomed to extract useful information from mined data. In that case, Spearman’s rank correlation coefficient, a nonparametric measure of rank correlation, is computed to infer the statistical dependence between fire occurrences and those diseases. Moreover, maps are also generated to represent the distribution of the mined data. From the results, it was possible to identify that each region showed a susceptible behaviour to some disease as well as some degree of correlation with fire outbreak, mainly in the drought period.

Keywords: health; fire; big data; Data Mining; Knowledge Discovery from Databases; machine learning (search for similar items in EconPapers)
JEL-codes: I I1 I3 Q Q5 (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1660-4601/17/10/3718/pdf (application/pdf)
https://www.mdpi.com/1660-4601/17/10/3718/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jijerp:v:17:y:2020:i:10:p:3718-:d:362544

Access Statistics for this article

IJERPH is currently edited by Ms. Jenna Liu

More articles in IJERPH from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jijerp:v:17:y:2020:i:10:p:3718-:d:362544