EconPapers    
Economics at your fingertips  
 

Enhanced Spatially Explicit Modeling of Soil Particle Size and Texture Classification Using a Novel Two-Point Machine Learning Hybrid Framework

Liya Qin, Zong Wang () and Xiaoyuan Zhang
Additional contact information
Liya Qin: Precision Forestry Key Laboratory of Beijing, College of Forestry, Beijing Forestry University, Beijing 100083, China
Zong Wang: Precision Forestry Key Laboratory of Beijing, College of Forestry, Beijing Forestry University, Beijing 100083, China
Xiaoyuan Zhang: Business School, Beijing Technology and Business University, Beijing 100048, China

Agriculture, 2025, vol. 15, issue 19, 1-29

Abstract: Accurately predicting soil particle size fractions (PSFs) and classifying soil texture types are essential for soil resource assessment and sustainable land management. PSFs, comprising clay, silt, and sand, form a compositional dataset constrained to sum to 100%. The practical implications of incorporating compositional data characteristics into PSF mapping remain insufficiently explored. This study applies a two-point machine learning (TPML) model, integrating spatial autocorrelation and attribute similarity, to enhance both the quantitative prediction of PSFs and the categorical classification of soil texture types in the Heihe River Basin, China. TPML was compared with random forest regression kriging (RFRK), random forest (RF), XGBoost, and ordinary kriging (OK), and a novel TPML-C model was developed for multi-class classification tasks. Results show that TPML achieved R 2 values of 0.58, 0.55, and 0.64 for clay, silt, and sand, respectively. Among all models, the ALR_TPML predictions showed the most consistent agreement with the observed variability, with predicted ranges of 2.63–98.28% for silt, 0.26–36.16% for clay, and 0.64–96.90% for sand. Across all models, the dominant soil texture types were identified as Sandy Loam (SaLo), Loamy Sand (LoSa), and Silty Loam (SiLo). For soil texture classification, TPML with raw, ALR-, and ILR-transformed data reached right ratios of 61.09%, 55.78%, and 60.00%, correctly identifying 25, 26, and 27 types out of 43. TPML with raw data exhibited strong performance in both regression and classification, with superior ability to separate ambiguous boundaries. Log-ratio transformations, particularly ILR, further improved classification performance by addressing the constraints of compositional data. These findings demonstrate the promise of hybrid machine learning approaches for digital soil mapping and precision agriculture.

Keywords: digital soil mapping; soil particle size fractions; machine learning; soil texture; two-point machine learning model (search for similar items in EconPapers)
JEL-codes: Q1 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2077-0472/15/19/2008/pdf (application/pdf)
https://www.mdpi.com/2077-0472/15/19/2008/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jagris:v:15:y:2025:i:19:p:2008-:d:1758504

Access Statistics for this article

Agriculture is currently edited by Ms. Leda Xuan

More articles in Agriculture from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-09-26
Handle: RePEc:gam:jagris:v:15:y:2025:i:19:p:2008-:d:1758504