EconPapers    
Economics at your fingertips  
 

Solid Waste Analysis Using Open-Access Socio-Economic Data

Jürgen Dunkel, David Dominguez, Óscar G. Borzdynski and Ángel Sánchez
Additional contact information
Jürgen Dunkel: Computer Science Department, Hochschule Hannover, 30459 Hannover, Germany
David Dominguez: Computer Engineering Department, Universidad Autónoma de Madrid, 28049 Madrid, Spain
Óscar G. Borzdynski: Computer Engineering Department, Universidad Autónoma de Madrid, 28049 Madrid, Spain
Ángel Sánchez: Computer Science and Statistics Department, Universidad Rey Juan Carlos, 28933 Mostoles, Spain

Sustainability, 2022, vol. 14, issue 3, 1-24

Abstract: Nowadays, problems related with solid waste management become a challenge for most countries due to the rising generation of waste, related environmental issues, and associated costs of produced wastes. Effective waste management systems at different geographic levels require accurate forecasting of future waste generation. In this work, we investigate how open-access data, such as provided from the Organisation for Economic Co-operation and Development (OECD), can be used for the analysis of waste data. The main idea of this study is finding the links between socio-economic and demographic variables that determine the amounts of types of solid wastes produced by countries. This would make it possible to accurately predict at the country level the waste production and determine the requirements for the development of effective waste management strategies. In particular, we use several machine learning data regression (Support Vector, Gradient Boosting, and Random Forest) and clustering models ( k -means) to respectively predict waste production for OECD countries along years and also to perform clustering among these countries according to similar characteristics. The main contributions of our work are: (1) waste analysis at the OECD country-level to compare and cluster countries according to similar waste features predicted; (2) the detection of most relevant features for prediction models; and (3) the comparison between several regression models with respect to accuracy in predictions. Coefficient of determination ( R 2 ), Mean Absolute Error ( MAE ), Root Mean Square Error ( RMSE ), and Mean Absolute Percentage Error ( MAPE ), respectively, are used as indices of the efficiency of the developed models. Our experiments have shown that some data pre-processings on the OECD data are an essential stage required in the analysis; that Random Forest Regressor (RFR) produced the best prediction results over the dataset; and that these results are highly influenced by the quality of available socio-economic data. In particular, the RFR model exhibited the highest accuracy in predictions for most waste types. For example, for “municipal” waste, it produced, respectively, R 2 = 1 and M A P E = 4.31 global error values for the test set; and for “household” waste, it, respectively, produced R 2 = 1 and M A P E = 3.03 . Our results indicate that the considered models (and specially RFR) all are effective in predicting the amount of produced wastes derived from input data for the considered countries.

Keywords: solid waste management; OECD datasets; machine learning; forecasting models on countries; clustering on countries; smart cities (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2071-1050/14/3/1233/pdf (application/pdf)
https://www.mdpi.com/2071-1050/14/3/1233/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:14:y:2022:i:3:p:1233-:d:730461

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jsusta:v:14:y:2022:i:3:p:1233-:d:730461