The Influence of Non-Landslide Sample Selection Methods on Landslide Susceptibility Prediction
Yu Fu,
Zhihao Fan,
Xiangzhi Li (),
Pengyu Wang,
Xiaoyue Sun,
Yu Ren and
Wengeng Cao ()
Additional contact information
Yu Fu: Henan Provincial Key Laboratory of Hydrosphere and Watershed Water Security, North China University of Water Resources and Electric Power, Zhengzhou 450011, China
Zhihao Fan: Henan Provincial Key Laboratory of Hydrosphere and Watershed Water Security, North China University of Water Resources and Electric Power, Zhengzhou 450011, China
Xiangzhi Li: The Institute of Hydrogeology and Environmental Geology, CAGS, Shijiazhuang 050061, China
Pengyu Wang: Henan Provincial Key Laboratory of Hydrosphere and Watershed Water Security, North China University of Water Resources and Electric Power, Zhengzhou 450011, China
Xiaoyue Sun: The Institute of Hydrogeology and Environmental Geology, CAGS, Shijiazhuang 050061, China
Yu Ren: The Institute of Hydrogeology and Environmental Geology, CAGS, Shijiazhuang 050061, China
Wengeng Cao: The Institute of Hydrogeology and Environmental Geology, CAGS, Shijiazhuang 050061, China
Land, 2025, vol. 14, issue 4, 1-21
Abstract:
Non-landslide sample selection is critical in landslide susceptibility modeling due to its direct impact on model accuracy and reliability. This study compares three sample selection strategies: whole-region random selection, landslide buffer zone selection, and the enhanced information value (EIV) method. By integrating these methods with the random forest (RF) algorithm, three models—random-RF, buffer zone-RF, and EIV-RF—were developed and evaluated. Using Henan Province as a case study, 20 environmental factors and 1021 landslide records were analyzed. The EIV method leverages machine learning to assign adaptive weights to influencing factors, prioritizing sample selection in low-susceptibility regions and avoiding high-susceptibility areas, thereby enhancing sample quality. Among the models, EIV-RF achieved the highest performance, with an AUC of 0.93, an accuracy of 85.31%, and a Kappa coefficient of 0.74. Additionally, the EIV method identified smaller, more concentrated high-susceptibility zones, covering 87.37% of historical landslide points, compared to the larger, less precise zones predicted by other methods. This study highlights the effectiveness of the EIV method in refining non-landslide sample selection and improving landslide susceptibility prediction, providing valuable insights for disaster risk reduction and land use planning.
Keywords: landslide; susceptibility assessment; machine learning; non-landslide sample; enhanced information value (search for similar items in EconPapers)
JEL-codes: Q15 Q2 Q24 Q28 Q5 R14 R52 (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2073-445X/14/4/722/pdf (application/pdf)
https://www.mdpi.com/2073-445X/14/4/722/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jlands:v:14:y:2025:i:4:p:722-:d:1622277
Access Statistics for this article
Land is currently edited by Ms. Carol Ma
More articles in Land from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().