EconPapers    
Economics at your fingertips  
 

WQI Improvement Based on XG-BOOST Algorithm and Exploration of Optimal Indicator Set

Jing Liu, Qi Chu, Wenchao Yuan (), Dasheng Zhang and Weifeng Yue ()
Additional contact information
Jing Liu: College of Architecture and Civil Engineering, Beijing University of Technology, Beijing 100124, China
Qi Chu: College of Architecture and Civil Engineering, Beijing University of Technology, Beijing 100124, China
Wenchao Yuan: Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Dasheng Zhang: Hebei Institute of Water Resources, Shijiazhuang 050051, China
Weifeng Yue: College of Water Sciences, Beijing Normal University, Beijing 100875, China

Sustainability, 2024, vol. 16, issue 24, 1-24

Abstract: This paper takes a portion of the Manas River Basin in Xinjiang Province, China, as an example and proposes an improved traditional comprehensive water quality index (WQI) method using Extreme Gradient Boosting (XG-BOOST) to analyze the groundwater quality levels in the region. Additionally, XG-BOOST is used to screen the existing dataset of ten water quality indicators, including fluoride (F), chlorine (Cl), nitrate (NO), sulfate (SO), silver (Ag), aluminum (Al), iron (Fe), lead (Pb), selenium (Se), and zinc (Zn), from 246 monitoring points, in order to find the dataset that optimizes model training performance. The results show that, in the selected study area, water quality categorized as “GOOD” and “POOR” accounts for the majority, with “GOOD” covering 48.7% of the area and “POOR” covering 31.6%. Regions with water quality classified as “UNFIT” are mainly distributed in the central–eastern parts of the study area, located in parts of the Changji Hui Autonomous Prefecture. Comparatively, water quality in the western part of the study area is better than that in the eastern part, while areas with “EXCELLENT” water quality are primarily distributed in the southern parts of the study area. The optimal water quality indicator dataset consists of five indicators: Cl, NO, Pb, Se, and Zn, achieving an accuracy of 98%, RMSE = 0.1414, and R 2 = 0.9081.

Keywords: WQI; groundwater; XG-BOOST; optimal indicator set screening (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2071-1050/16/24/10991/pdf (application/pdf)
https://www.mdpi.com/2071-1050/16/24/10991/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:16:y:2024:i:24:p:10991-:d:1543989

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jsusta:v:16:y:2024:i:24:p:10991-:d:1543989