Multi-class random forest model to classify wastewater treatment imbalanced data
Veronica Distefano,
Monica Palma and
Sandra De Iaco
Socio-Economic Planning Sciences, 2024, vol. 95, issue C
Abstract:
The odor emissions generated by treatment plants imply complex environmental and economic issues. The modern instrumental odor monitoring systems, based on an array of several sensors, continuously record the gaseous compounds. However they are characterized by poor selectivity, compromising the possibility to discriminate and identify the emission sources. In this paper, the ability of odor sensors to distinguish between the treatment plant sections generating the gaseous compounds is evaluated on the basis of the random forest classifier, and is also compared to the discriminant analysis performance. Taking into account that a multi-parametric system of sensors can be affected by the presence of a small sample size with imbalanced classes, several strategies for data balancing are proposed and analyzed. The findings show that the random forest classifier is characterized by a better capacity to distinguish the emissions sources with respect to the classical multiple discriminant analysis, in terms of all evaluation metrics. This is also confirmed for different resampling techniques, especially in the over-sampling case. The data concerning measurements from 10 sensors of multi-parametric systems of odor monitoring collected from a company specialized in environmental assistance are considered for this analysis.
Keywords: Multi-classification; Data imbalance; Resampling approach; Treatment plant sections; Electronic nose; Machine learning (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0038012124002209
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:soceps:v:95:y:2024:i:c:s0038012124002209
DOI: 10.1016/j.seps.2024.102021
Access Statistics for this article
Socio-Economic Planning Sciences is currently edited by Barnett R. Parker
More articles in Socio-Economic Planning Sciences from Elsevier
Bibliographic data for series maintained by Catherine Liu ().