Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants
Na Tang,
Maoxiang Yuan,
Zhijun Chen,
Jian Ma,
Rui Sun,
Yide Yang,
Quanyuan He,
Xiaowei Guo,
Shixiong Hu () and
Junhua Zhou ()
Additional contact information
Na Tang: The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, School of Medicine, Hunan Normal University, Changsha 410013, China
Maoxiang Yuan: Changde Center for Disease Control and Prevention, Changde 415000, China
Zhijun Chen: The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, School of Medicine, Hunan Normal University, Changsha 410013, China
Jian Ma: The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, School of Medicine, Hunan Normal University, Changsha 410013, China
Rui Sun: The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, School of Medicine, Hunan Normal University, Changsha 410013, China
Yide Yang: The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, School of Medicine, Hunan Normal University, Changsha 410013, China
Quanyuan He: The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, School of Medicine, Hunan Normal University, Changsha 410013, China
Xiaowei Guo: The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, School of Medicine, Hunan Normal University, Changsha 410013, China
Shixiong Hu: Hunan Provincial Center for Disease Control and Prevention, Changsha 410005, China
Junhua Zhou: The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, School of Medicine, Hunan Normal University, Changsha 410013, China
IJERPH, 2023, vol. 20, issue 5, 1-17
Abstract:
Background: Tuberculosis (TB) is a public health problem worldwide, and the influence of meteorological and air pollutants on the incidence of tuberculosis have been attracting interest from researchers. It is of great importance to use machine learning to build a prediction model of tuberculosis incidence influenced by meteorological and air pollutants for timely and applicable measures of both prevention and control. Methods: The data of daily TB notifications, meteorological factors and air pollutants in Changde City, Hunan Province ranging from 2010 to 2021 were collected. Spearman rank correlation analysis was conducted to analyze the correlation between the daily TB notifications and the meteorological factors or air pollutants. Based on the correlation analysis results, machine learning methods, including support vector regression, random forest regression and a BP neural network model, were utilized to construct the incidence prediction model of tuberculosis. RMSE, MAE and MAPE were performed to evaluate the constructed model for selecting the best prediction model. Results: (1) From the year 2010 to 2021, the overall incidence of tuberculosis in Changde City showed a downward trend. (2) The daily TB notifications was positively correlated with average temperature (r = 0.231), maximum temperature (r = 0.194), minimum temperature (r = 0.165), sunshine duration (r = 0.329), PM 2.5 (r = 0.097), PM 10 (r = 0.215) and O 3 (r = 0.084) ( p < 0.05). However, there was a significant negative correlation between the daily TB notifications and mean air pressure (r = −0.119), precipitation (r = −0.063), relative humidity (r = −0.084), CO (r = −0.038) and SO 2 (r = −0.034) ( p < 0.05). (3) The random forest regression model had the best fitting effect, while the BP neural network model exhibited the best prediction. (4) The validation set of the BP neural network model, including average daily temperature, sunshine hours and PM 10 , showed the lowest root mean square error, mean absolute error and mean absolute percentage error, followed by support vector regression. Conclusions: The prediction trend of the BP neural network model, including average daily temperature, sunshine hours and PM 10 , successfully mimics the actual incidence, and the peak incidence highly coincides with the actual aggregation time, with a high accuracy and a minimum error. Taken together, these data suggest that the BP neural network model can predict the incidence trend of tuberculosis in Changde City.
Keywords: tuberculosis; machine learning; forecasting; neural networks; random forest regression; support vector regression (search for similar items in EconPapers)
JEL-codes: I I1 I3 Q Q5 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/1660-4601/20/5/3910/pdf (application/pdf)
https://www.mdpi.com/1660-4601/20/5/3910/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jijerp:v:20:y:2023:i:5:p:3910-:d:1076934
Access Statistics for this article
IJERPH is currently edited by Ms. Jenna Liu
More articles in IJERPH from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().