EconPapers    
Economics at your fingertips  
 

Feature Selection and Regression Models for Multisource Data-Based Soil Salinity Prediction: A Case Study of Minqin Oasis in Arid China

Sheshu Zhang, Jun Zhao (), Jianxia Yang, Jinfeng Xie and Ziyun Sun
Additional contact information
Sheshu Zhang: College of Geography and Environment Science, Northwest Normal University, Lanzhou 730070, China
Jun Zhao: College of Geography and Environment Science, Northwest Normal University, Lanzhou 730070, China
Jianxia Yang: College of Geography and Environment Science, Northwest Normal University, Lanzhou 730070, China
Jinfeng Xie: College of Geography and Environment Science, Northwest Normal University, Lanzhou 730070, China
Ziyun Sun: College of Geography and Environment Science, Northwest Normal University, Lanzhou 730070, China

Land, 2024, vol. 13, issue 6, 1-21

Abstract: (1) Monitoring salinized soil in saline–alkali land is essential, requiring regional-scale soil salinity inversion. This study aims to identify sensitive variables for predicting electrical conductivity (EC) in soil, focusing on effective feature selection methods. (2) The study systematically selects a feature subset from Sentinel-1 C SAR, Sentinel-2 MSI, and SRTM DEM data. Various feature selection methods (correlation analysis, LASSO, RFE, and GRA) are employed on 79 variables. Regression models using random forest regression (RF) and partial least squares regression (PLSR) algorithms are constructed and compared. (3) The results highlight the effectiveness of the RFE algorithm in reducing model complexity. The model incorporates significant environmental factors like soil moisture, topography, and soil texture, which play an important role in modeling. Combining the method with RF improved soil salinity prediction (R 2 = 0.71, RMSE = 1.47, RPD = 1.84). Overall, salinization in Minqin oasis soils was evident, especially in the unutilized land at the edge of the oasis. (4) Integrating data from different sources to construct characterization variables overcomes the limitations of a single data source. Variable selection is an effective means to address the redundancy of variable information, providing insights into feature engineering and variable selection for soil salinity estimation in arid and semi-arid regions.

Keywords: remote sensing; soil salinization; feature subset selection; environmental covariate; red edge; texture feature (search for similar items in EconPapers)
JEL-codes: Q15 Q2 Q24 Q28 Q5 R14 R52 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2073-445X/13/6/877/pdf (application/pdf)
https://www.mdpi.com/2073-445X/13/6/877/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jlands:v:13:y:2024:i:6:p:877-:d:1416867

Access Statistics for this article

Land is currently edited by Ms. Carol Ma

More articles in Land from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jlands:v:13:y:2024:i:6:p:877-:d:1416867