Groundwater Fluoride Prediction for Sustainable Water Management: A Comparative Evaluation of Machine Learning Approaches Enhanced by Satellite Embeddings
Yunbo Wei,
Rongfu Zhong and
Yun Yang ()
Additional contact information
Yunbo Wei: School of Earth Sciences and Engineering, Hohai University, Nanjing 211100, China
Rongfu Zhong: Zhejiang Environmental Technology Co., Ltd., Hangzhou 311000, China
Yun Yang: School of Earth Sciences and Engineering, Hohai University, Nanjing 211100, China
Sustainability, 2025, vol. 17, issue 18, 1-16
Abstract:
Groundwater fluoride contamination poses a significant threat to sustainable water resources and public health, yet conventional water quality analysis is both time-consuming and costly, making large-scale, sustainable monitoring challenging. Machine learning methods offer a promising, cost-effective, and sustainable alternative for assessing the spatial distribution of fluoride. This study aimed to develop and compare the performance of Random Forest (RF), Support Vector Machine (SVM), and Artificial Neural Network (ANN) models for predicting groundwater fluoride contamination in the Datong Basin with the help of satellite embeddings from the AlphaEarth Foundation. Data from 391 groundwater sampling points were utilized, with the dataset partitioned into training (80%) and testing (20%) sets. The ANOVA F-value of each feature was calculated for feature selection, identifying surface elevation, pollution, population, evaporation, vertical distance to the rivers, distance to the Sanggan river, and nine extra bands from the satellite embeddings as the most relevant input variables. Model performance was evaluated using the confusion matrix and the area under the receiver operating characteristic curve (ROC-AUC). The results showed that the SVM model demonstrated the highest ROC-AUC (0.82), outperforming the RF (0.80) and MLP (0.77) models. The introduction of satellite embeddings improved the performance of all three models significantly, with the prediction errors decreasing by 13.8% to 23.3%. The SVM model enhanced by satellite embeddings proved to be a robust and reliable tool for predicting groundwater fluoride contamination, highlighting its potential for use in sustainable groundwater management.
Keywords: Random Forest; Artificial Neural Network; Support Vector Machine; AlphaEarth (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/17/18/8505/pdf (application/pdf)
https://www.mdpi.com/2071-1050/17/18/8505/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:17:y:2025:i:18:p:8505-:d:1755180
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().