Optimizing the Sample Selection of Machine Learning Models for Landslide Susceptibility Prediction Using Information Value Models in the Dabie Mountain Area of Anhui, China
Yanrong Liu,
Zhongqiu Meng,
Lei Zhu,
Di Hu and
Handong He ()
Additional contact information
Yanrong Liu: School of Resources and Environment, Anhui Agricultural University, Hefei 230036, China
Zhongqiu Meng: School of Resources and Environment, Anhui Agricultural University, Hefei 230036, China
Lei Zhu: School of Economics and Management, Beihang University, Beijing 100191, China
Di Hu: Key Laboratory of Virtual Geographic Environment, Nanjing Normal University, Ministry of Education, Nanjing 210023, China
Handong He: School of Resources and Environment, Anhui Agricultural University, Hefei 230036, China
Sustainability, 2023, vol. 15, issue 3, 1-23
Abstract:
The evaluation of landslide susceptibility is of great significance in the prevention and management of geological hazards. The accuracy of the landslide susceptibility prediction model based on machine learning is significantly higher than that of traditional expert knowledge and the conventional mathematical statistics model. The correct and reasonable selection of non-landslide samples in the machine learning model greatly improves the prediction accuracy and reliability of the regional landslide susceptibility model. Focusing on the problem of selecting non-landslide samples in the machine learning model for landslide susceptibility evaluation, this paper proposes a landslide susceptibility evaluation method based on the combination of an information model and machine learning in traditional mathematical statistics. First, the influence factors for landslide susceptibility evaluation are screened by the correlation analysis method. Second, the information value model is used to delimit areas with low and relatively low landslide susceptibility, and non-landslide points are randomly selected. Third, a landslide susceptibility evaluation method combined with IV-ML, such as logistic regression (IV-LR), random forest (IV-RF), support vector machine (IV-SVM), and artificial neural network (IV-ANN), is established. Finally, the landslide susceptibility factors in the Dabie Mountain area of Anhui Province are analyzed, and the accuracy of the landslide susceptibility evaluation results using the IV-LR, IV-RF, IV-SVM, and IV-ANN and LR, RF, SVM, and ANN methods are compared. The accuracy is evaluated by examining the ACC, AUC, and kappa values of the model. The results indicate that the evaluation effect of the IV-ML models (IV-LR, IV-RF, IV-SVM, IV-ANN) on landslide susceptibility is significantly higher than that of the ML models (LR, RF, SVM, ANN).
Keywords: machine learning models; landslide susceptibility prediction; information value models; non-landslide unit (sample) (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (5)
Downloads: (external link)
https://www.mdpi.com/2071-1050/15/3/1971/pdf (application/pdf)
https://www.mdpi.com/2071-1050/15/3/1971/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:15:y:2023:i:3:p:1971-:d:1041887
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().