Addressing overfitting and overestimation challenges in landslide susceptibility modeling: a case study of Penang Island, Malaysia
Dorothy Anak Martin Atok () and
Soo See Chai ()
Additional contact information
Dorothy Anak Martin Atok: University of Malaysia Sarawak
Soo See Chai: University of Malaysia Sarawak
Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, 2025, vol. 121, issue 11, No 48, 13577-13604
Abstract:
Abstract In the realm of landslide susceptibility prediction, the challenge of overfitting and overestimation has persisted despite various modeling attempts. This study aims to elevate the predictive capabilities of the Extreme Gradient Boosting (XGBoost) and Random Forest (RF) models for landslide susceptibility assessment through the innovative application of Bayesian Optimization (BO). Using data from Penang Island in Malaysia, we comprehensively incorporated topographical, hydrological, human, and environmental factors influencing landslides. Leveraging Geographic Information System (GIS) tools, we meticulously constructed spatial databases encompassing all pertinent landslide conditioning elements. Our findings unveil the remarkable performance of the optimized XGBoost model, achieving an astounding 100.0% Success Rate (SR) and an impressive 97.1% Prediction Rate (PR). In comparison, the optimized RF model achieved an SR of 99.7% and a PR of 96.3%, while the stacked models followed closely with an SR of 96.8% and a PR of 95.6%. These conclusive results underscore the transformative potential of addressing overfitting and overestimation challenges through the strategic combination of stacking and hyperparameter optimization. The improved accuracy of these algorithms bears immense significance, extending to applications in site selection, engineering structure health monitoring, and disaster mitigation, thus elevating the importance of Landslide Susceptibility Maps (LSMs) in safeguarding communities and infrastructure.
Keywords: Extreme gradient boosting; Geographic information system; Hybrid; Landslide susceptibility; Random forest (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11069-025-07329-6 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:nathaz:v:121:y:2025:i:11:d:10.1007_s11069-025-07329-6
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11069
DOI: 10.1007/s11069-025-07329-6
Access Statistics for this article
Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards is currently edited by Thomas Glade, Tad S. Murty and Vladimír Schenk
More articles in Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards from Springer, International Society for the Prevention and Mitigation of Natural Hazards
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().