A Positive-Unlabeled Learning Algorithm for Urban Flood Susceptibility Modeling
Wenkai Li (),
Yuanchi Liu,
Ziyue Liu,
Zhen Gao,
Huabing Huang and
Weijun Huang
Additional contact information
Wenkai Li: School of Geography and Planning, Sun Yat-Sen University, Guangzhou 510006, China
Yuanchi Liu: School of Geography and Planning, Sun Yat-Sen University, Guangzhou 510006, China
Ziyue Liu: School of Geography and Planning, Sun Yat-Sen University, Guangzhou 510006, China
Zhen Gao: Guangzhou Institute of Geography, Guangdong Academy of Sciences, Guangzhou 510070, China
Huabing Huang: School of Geography and Planning, Sun Yat-Sen University, Guangzhou 510006, China
Weijun Huang: School of Geography and Planning, Sun Yat-Sen University, Guangzhou 510006, China
Land, 2022, vol. 11, issue 11, 1-17
Abstract:
Flood susceptibility modeling helps understand the relationship between influencing factors and occurrence of urban flooding and further provides spatial distribution of flood risk, which is critical for flood-risk reduction. Machine learning methods have been widely applied in flood susceptibility modeling, but traditional supervised learning requires both positive (flood) and negative (non-flood) samples in model training. Historical flood inventory data usually contain positive-only data, whereas negative data selected from areas without flood records are prone to be contaminated by positive data, which is referred to as case-control sampling with contaminated controls. In order to address this problem, we propose to apply a novel positive-unlabeled learning algorithm, namely positive and background learning with constraints (PBLC), in flood susceptibility modeling. PBLC trains a binary classifier from case-control positive and unlabeled samples without requiring truly labeled negative data. With historical records of flood locations and environmental covariates, including elevation, slope, aspect, plan curvature, profile curvature, slope length factor, stream power index, topographic position index, topographic wetness index, distance to rivers, distance to roads, land use, normalized difference vegetation index, and precipitation, we compared the performances of the traditional artificial neural network (ANN) and the novel PBLC in flood susceptibility modeling in the city of Guangzhou, China. Experimental results show that PBLC can produce more calibrated probabilistic prediction, more accurate binary prediction, and more reliable susceptibility mapping of urban flooding than traditional ANN, indicating that PBLC is effective in addressing the problem of case-control sampling with contaminated controls and it can be successfully applied in urban flood susceptibility mapping.
Keywords: urban flooding; susceptibility; machine learning; positive data; unlabeled data (search for similar items in EconPapers)
JEL-codes: Q15 Q2 Q24 Q28 Q5 R14 R52 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www.mdpi.com/2073-445X/11/11/1971/pdf (application/pdf)
https://www.mdpi.com/2073-445X/11/11/1971/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jlands:v:11:y:2022:i:11:p:1971-:d:962994
Access Statistics for this article
Land is currently edited by Ms. Carol Ma
More articles in Land from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().