WQI Improvement Based on XG-BOOST Algorithm and Exploration of Optimal Indicator Set
Jing Liu,
Qi Chu,
Wenchao Yuan (),
Dasheng Zhang and
Weifeng Yue ()
Additional contact information
Jing Liu: College of Architecture and Civil Engineering, Beijing University of Technology, Beijing 100124, China
Qi Chu: College of Architecture and Civil Engineering, Beijing University of Technology, Beijing 100124, China
Wenchao Yuan: Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Dasheng Zhang: Hebei Institute of Water Resources, Shijiazhuang 050051, China
Weifeng Yue: College of Water Sciences, Beijing Normal University, Beijing 100875, China
Sustainability, 2024, vol. 16, issue 24, 1-24
Abstract:
This paper takes a portion of the Manas River Basin in Xinjiang Province, China, as an example and proposes an improved traditional comprehensive water quality index (WQI) method using Extreme Gradient Boosting (XG-BOOST) to analyze the groundwater quality levels in the region. Additionally, XG-BOOST is used to screen the existing dataset of ten water quality indicators, including fluoride (F), chlorine (Cl), nitrate (NO), sulfate (SO), silver (Ag), aluminum (Al), iron (Fe), lead (Pb), selenium (Se), and zinc (Zn), from 246 monitoring points, in order to find the dataset that optimizes model training performance. The results show that, in the selected study area, water quality categorized as “GOOD” and “POOR” accounts for the majority, with “GOOD” covering 48.7% of the area and “POOR” covering 31.6%. Regions with water quality classified as “UNFIT” are mainly distributed in the central–eastern parts of the study area, located in parts of the Changji Hui Autonomous Prefecture. Comparatively, water quality in the western part of the study area is better than that in the eastern part, while areas with “EXCELLENT” water quality are primarily distributed in the southern parts of the study area. The optimal water quality indicator dataset consists of five indicators: Cl, NO, Pb, Se, and Zn, achieving an accuracy of 98%, RMSE = 0.1414, and R 2 = 0.9081.
Keywords: WQI; groundwater; XG-BOOST; optimal indicator set screening (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/16/24/10991/pdf (application/pdf)
https://www.mdpi.com/2071-1050/16/24/10991/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:16:y:2024:i:24:p:10991-:d:1543989
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().