Application of Machine Learning Approaches to Predict Soil Element Background Concentration at Large Region Scale
Jiao Li,
Linglong Meng,
Tianran Li,
Pengli Xue (),
Hejing Wang () and
Jie Hua
Additional contact information
Jiao Li: Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Linglong Meng: Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Tianran Li: Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Pengli Xue: Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Hejing Wang: Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Jie Hua: Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
Sustainability, 2025, vol. 17, issue 17, 1-22
Abstract:
Soil element background concentration is foundational data for environmental quality assessment, contamination diagnosis, and sustainable land management. However, existing investigation-based methods are time-consuming and inefficient. The machine learning (ML) method has demonstrated excellent performance in predicting soil heavy metal concentration. In this study, based on the nine environmental variables of soil formation from 210 soil monitoring points, including elevation, pH, organic matter, soil type, parent material, plant cover, land use type, topography, and soil texture, decision tree (DT), random forest (RF), extreme gradient boosting (XGB), and support vector machine (SVM) models were used to predict the eleven soil element background concentrations. Among them, SVM and RF models could be used for an effective prediction of the background concentration of all soil heavy metals. Compared with the XGBoost and DT, the SVM for all heavy metals except for cadmium (Cd) and manganese (Mn) performs best. Although the key factors affecting background concentrations vary among different soil elements, organic matter, soil type, and altitude, they play a crucial role in the accurate prediction of soil element background concentration. This study provides simple and efficient ML models for predicting soil element background concentration at the large regional scale. The results of this study can be utilized to distinguish natural geochemical processes from human-induced pollution.
Keywords: soil element background concentration; heavy metals; principal component analysis; machine learning model; predictive accuracy (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/17/17/7853/pdf (application/pdf)
https://www.mdpi.com/2071-1050/17/17/7853/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:17:y:2025:i:17:p:7853-:d:1738780
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().