An Adaptive Graph Convolutional Network with Spatial Autocorrelation for Enhancing 3D Soil Pollutant Mapping Precision from Sparse Borehole Data
Huan Tao,
Ziyang Li (),
Shengdong Nie,
Hengkai Li and
Dan Zhao ()
Additional contact information
Huan Tao: Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences (CAS), Beijing 100101, China
Ziyang Li: School of Information Science and Technology, Beijing Forestry University, Beijing 100083, China
Shengdong Nie: Civil and Surveying & Mapping Engineering, Jiangxi University of Science and Technology, Ganzhou 341000, China
Hengkai Li: Civil and Surveying & Mapping Engineering, Jiangxi University of Science and Technology, Ganzhou 341000, China
Dan Zhao: Key Laboratory of Environmental Damage Identification and Restoration, Ministry of Ecology and Environment, Beijing 100041, China
Land, 2025, vol. 14, issue 7, 1-15
Abstract:
Sparse borehole sampling at contaminated sites results in sparse and unevenly distributed data on soil pollutants. Traditional interpolation methods may obscure local variations in soil contamination when applied to such sparse data, thus reducing the interpolation accuracy. We propose an adaptive graph convolutional network with spatial autocorrelation (ASI-GCN) model to overcome this challenge. The ASI-GCN model effectively constrains pollutant concentration transfer while capturing subtle spatial variations, improving soil pollution characterization accuracy. We tested our model at a coking plant using 215 soil samples from 15 boreholes, evaluating its robustness with three pollutants of varying volatility: arsenic (As, non-volatile), benzo(a)pyrene (BaP, semi-volatile), and benzene (Ben, volatile). Leave-one-out cross-validation demonstrates that the ASI-GCN_RC_G model (ASI-GCN with residual connections) achieves the highest prediction accuracy. Specifically, the R for As, BaP, and Ben are 0.728, 0.825, and 0.781, respectively, outperforming traditional models by 58.8% (vs. IDW), 45.82% (vs. OK), and 53.78% (vs. IDW). Meanwhile, their RMSE drop by 36.56% (vs. Bayesian_K), 38.02% (vs. Bayesian_K), and 35.96% (vs. IDW), further confirming the model’s superior precision. Beyond accuracy, Monte Carlo uncertainty analysis reveals that most predicted areas exhibit low uncertainty, with only a few high-pollution hotspots exhibiting relatively high uncertainty. Further analysis revealed the significant influence of pollutant volatility on vertical migration patterns. Non-volatile As was primarily distributed in the fill and silty sand layers, and semi-volatile BaP concentrated in the silty sand layer. At the same time, volatile Ben was predominantly found in the clay and fine sand layers. By integrating spatial autocorrelation with deep graph representation, ASI-GCN redefines sparse data 3D mapping, offering a transformative tool for precise environmental governance and human health assessment.
Keywords: soil pollution; graph neural network; sparse samples; 3D spatial interpolation; contaminated site (search for similar items in EconPapers)
JEL-codes: Q15 Q2 Q24 Q28 Q5 R14 R52 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2073-445X/14/7/1348/pdf (application/pdf)
https://www.mdpi.com/2073-445X/14/7/1348/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jlands:v:14:y:2025:i:7:p:1348-:d:1687134
Access Statistics for this article
Land is currently edited by Ms. Carol Ma
More articles in Land from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().