EconPapers    
Economics at your fingertips  
 

Groundwater Quality Assessment Based on the Random Forest Water Quality Index—Taking Karamay City as an Example

Yanna Xiong, Tianyi Zhang, Xi Sun, Wenchao Yuan, Mingjun Gao, Jin Wu () and Zhijun Han ()
Additional contact information
Yanna Xiong: Technical Centre for Soil, Agricultural and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Tianyi Zhang: Faculty of Architecture, Civil and Transportation Engineering, Beijing University of Technology, Beijing 100124, China
Xi Sun: School of Civil and Architectural Engineering, Anyang Institute of Technology, Anyang 455000, China
Wenchao Yuan: Technical Centre for Soil, Agricultural and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Mingjun Gao: Technical Centre for Soil, Agricultural and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Jin Wu: Faculty of Architecture, Civil and Transportation Engineering, Beijing University of Technology, Beijing 100124, China
Zhijun Han: Sino-Japan Friendship Centre for Environmental Protection, Beijing 100012, China

Sustainability, 2023, vol. 15, issue 19, 1-18

Abstract: In the past few decades, global industrial development and population growth have led to a scarcity of water resources, making sustainable management of groundwater a global challenge. The Water Quality Index (WQI) serves as a comprehensive method for assessing water quality and can provide valuable recommendations at the water quality level, optimizing policies for groundwater management. However, the subjectivity and uncertainty of the traditional WQI have negative impacts on evaluation outcomes, particularly in determining indicator weights and selecting aggregation functions. The proposed water quality index for groundwater based on the random forest (RFWQI) model in this study addresses these issues. It selects water quality indicators based on the actual pollution situation in the study area, employs an advanced random forest model to rank water quality indicators, determines indicator weights using the rank centroid method, scores the indicators using a sub-index function designed for groundwater development, and compares the results of two commonly used aggregation functions to identify the optimal one. Based on the aggregated scores, the water quality at 137 monitoring sites is classified into five levels: “Excellent”, “Good”, “Medium”, “Poor”, or “Unacceptable”. Among the 11 water quality indicators (sodium, sulfate, chloride, bicarbonate, total dissolved solids, fluoride, boron, nitrate, pH, COD Mn , and hardness), chloride was given the highest weight (0.236), followed by total dissolved solids (0.156), and sodium was given the lowest weight (0.008). The random forest model exhibits a good prediction capability before hyperparameter tuning (86% accuracy, RMSE of 0.378), and after grid search and five-fold cross-validation, the optimal hyperparameter combination is determined, further improving the performance of the random forest model (94% accuracy, F1-Score of 0.967, AUC of 0.91, RMSE of 0.232). For the newly developed groundwater sub-index function, interpolation is used to score each indicator, and after comparing two aggregation functions, the NSF aggregation function is selected as the most suitable for groundwater assessment. Overall, most of the groundwater in the study area was of poor quality (52.5% of low quality) and not suitable for drinking.

Keywords: water quality index; random forest; groundwater; aggregation function; machine learning (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2071-1050/15/19/14477/pdf (application/pdf)
https://www.mdpi.com/2071-1050/15/19/14477/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:15:y:2023:i:19:p:14477-:d:1253601

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jsusta:v:15:y:2023:i:19:p:14477-:d:1253601