Modeling the optimal dosage of coagulants in water treatment plants using various machine learning models
Mohammed Achite (),
Saeed Farzin (),
Nehal Elshaboury (),
Mahdi Valikhan Anaraki (),
Mohammed Amamra () and
Abderrezak Kamel Toubal ()
Additional contact information
Mohammed Achite: Hassiba Benbouali University of Chlef
Saeed Farzin: Semnan University
Nehal Elshaboury: Construction and Project Management Research Institute
Mahdi Valikhan Anaraki: Semnan University
Mohammed Amamra: Hassiba Benbouali University of Chlef
Abderrezak Kamel Toubal: Hassiba Benbouali University of Chlef
Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, 2024, vol. 26, issue 2, No 25, 3395-3421
Abstract:
Abstract One of the main methods for determining coagulant dosage (CD) is the jar test. However, this method is expensive, time-consuming, and requires laboratory equipment. In this situation, machine learning, especially hybrid machine learning, is suitable to estimate the CD without requiring a jar test. To this end, a hybrid model based on a combination of the M5 and the gorilla troops optimizer (GTO) algorithm was introduced as the M5-GTO model. Nine different parameters, including raw water production (RWP), turbidity of water, Conductivity, TDS, Salinity, pH, water temperature (WT), SM, and O2, were also utilized as inputs in the CD modeling. The results of comparing the proposed model with the multiple linear regression, multiple nonlinear regression, artificial neural network, multivariate adaptive regression splines, M5 model tree, k-nearest neighbor, least-squares support vector machine, general regression neural network, and random forest (RF) showed that this model is more accurate in CD modeling. Besides, the M5-GTO well estimated the distribution of observational data. For example, the values of the mean absolute error, root mean square error (RMSE), relative RMSE, normalized RMSE, and correlation coefficient criteria for the M5-GTO were equal to 0.562, 1.172, 0.257, 0.170, and 0.967, which were up to 73%, 54.9%, 54.8%, 55%, and 4% are more accurate than LSSVM (worst algorithm), respectively. In addition, the M5 and RF algorithms were also in the second and third ranks. The partial dependence plots results showed that the RWP and WT had the most significant effects on CD changes. Increasing the RWP reduced the amount of CD, while increasing the WT increased the CD. The algorithm introduced in the present study has a high potential for modeling various parameters in water treatment plants.
Keywords: Water quality; Optimal coagulant dosage; Water treatment plant; Machine learning models; M5-GTO algorithm (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10668-022-02835-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:endesu:v:26:y:2024:i:2:d:10.1007_s10668-022-02835-0
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/10668
DOI: 10.1007/s10668-022-02835-0
Access Statistics for this article
Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development is currently edited by Luc Hens
More articles in Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().