EconPapers    
Economics at your fingertips  
 

Improving Water Quality Index Prediction Using Regression Learning Models

Jesmeen Mohd Zebaral Hoque, Nor Azlina Ab. Aziz (), Salem Alelyani, Mohamed Mohana and Maruf Hosain
Additional contact information
Jesmeen Mohd Zebaral Hoque: Faculty of Engineering & Technology, Multimedia University, Melaka 75450, Malaysia
Nor Azlina Ab. Aziz: Faculty of Engineering & Technology, Multimedia University, Melaka 75450, Malaysia
Salem Alelyani: Center for Artificial Intelligence (CAI), King Khalid University, Abha 61421, Saudi Arabia
Mohamed Mohana: Center for Artificial Intelligence (CAI), King Khalid University, Abha 61421, Saudi Arabia
Maruf Hosain: Faculty of Engineering & Technology, Multimedia University, Melaka 75450, Malaysia

IJERPH, 2022, vol. 19, issue 20, 1-23

Abstract: Rivers are the main sources of freshwater supply for the world population. However, many economic activities contribute to river water pollution. River water quality can be monitored using various parameters, such as the pH level, dissolved oxygen, total suspended solids, and the chemical properties. Analyzing the trend and pattern of these parameters enables the prediction of the water quality so that proactive measures can be made by relevant authorities to prevent water pollution and predict the effectiveness of water restoration measures. Machine learning regression algorithms can be applied for this purpose. Here, eight machine learning regression techniques, including decision tree regression, linear regression, ridge, Lasso, support vector regression, random forest regression, extra tree regression, and the artificial neural network, are applied for the purpose of water quality index prediction. Historical data from Indian rivers are adopted for this study. The data refer to six water parameters. Twelve other features are then derived from the original six parameters. The performances of the models using different algorithms and sets of features are compared. The derived water quality rating scale features are identified to contribute toward the development of better regression models, while the linear regression and ridge offer the best performance. The best mean square error achieved is 0 and the correlation coefficient is 1.

Keywords: water quality index; regression; linear regression; ridge (search for similar items in EconPapers)
JEL-codes: I I1 I3 Q Q5 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1660-4601/19/20/13702/pdf (application/pdf)
https://www.mdpi.com/1660-4601/19/20/13702/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jijerp:v:19:y:2022:i:20:p:13702-:d:949916

Access Statistics for this article

IJERPH is currently edited by Ms. Jenna Liu

More articles in IJERPH from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jijerp:v:19:y:2022:i:20:p:13702-:d:949916