Predicting COVID-19 Spread Level using Socio-Economic Indicators and Machine Learning Techniques
Alaeddine Mihoub (),
Hosni Snoun (),
Moez Krichen (),
Montassar Kahia () and
Riadh Bel Hadj Salah
Additional contact information
Moez Krichen: REDCAD - Unité de Recherche en développement et contrôle d'applications distribuées - ENIS - École Nationale d'Ingénieurs de Sfax | National School of Engineers of Sfax
Post-Print from HAL
Abstract:
The new so-called COVID-19 virus is unfortunately founded to be highly transmissible across the globe. In this study, we propose a novel approach for estimating the spread level of the virus for each country for three different dates between April and May 2020. Unlike previous studies, this investigation does not process any historical data of spread but rather relies on the socioeconomic indicators of each country. Actually, more than 1000 socioeconomic indicators and more than 190 countries were processed in this study. Concretely, data preprocessing techniques and feature selection approaches were applied to extract relevant indicators for the classification process. Countries around the globe were assigned to 4 classes of spread. To find the class level of each country, many classifiers were proposed based especially on Support Vectors Machines (SVM), Multi-Layer Perceptrons (MLP) and Random Forests (RF). Obtained results show the relevance of our approach since many classifiers succeeded in capturing the spread level, especially the RF classifier, with an F-measure equal to 93.85% for April 15th, 2020. Moreover, a feature importance study is conducted to deduce the best indicators to build robust spread level classifiers. However, as pointed out in the discussion, classifiers may face some difficulties for future dates since the huge increase of cases and the lack of other relevant factors affecting this widespread.
Keywords: covid-19; socio-economic indicators; data preprocessing; spread level prediction; machine learning; country classification; coronavirus; SARS-CoV-2; feature importance (search for similar items in EconPapers)
Date: 2020-11-03
New Economics Papers: this item is included in nep-big and nep-cmp
Note: View the original document on HAL open archive server: https://hal.science/hal-03002886
References: View references in EconPapers View complete reference list from CitEc
Citations:
Published in SMARTTECH 2020 - The First International Conference of Smart Systems and Emerging Technologies, Nov 2020, Riyadh, Saudi Arabia. ⟨10.1109/SMART-TECH49988.2020.00041⟩
Downloads: (external link)
https://hal.science/hal-03002886/document (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:hal-03002886
DOI: 10.1109/SMART-TECH49988.2020.00041
Access Statistics for this paper
More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().