Flood Susceptibility Assessment with Random Sampling Strategy in Ensemble Learning (RF and XGBoost)

Author:

Ren Hancheng12,Pang Bo12,Bai Ping3,Zhao Gang4ORCID,Liu Shu56,Liu Yuanyuan56,Li Min56

Affiliation:

1. College of Water Sciences, Beijing Normal University, Beijing 100875, China

2. Beijing Key Laboratory of Urban Hydrological Cycle and Sponge City Technology, Beijing 100875, China

3. Kunming Flood Control and Drought Relief Headquarters Office, Kunming 650000, China

4. Institute of Industrial Science, University of Tokyo, Tokyo 153−8505, Japan

5. China Institute of Water Resources and Hydropower Research, Beijing 100038, China

6. Research Center on Flood and Drought Disaster Reduction of the Ministry of Water Resources, Beijing 100038, China

Abstract

Due to the complex interaction of urban and mountainous floods, assessing flood susceptibility in mountainous urban areas presents a challenging task in environmental research and risk analysis. Data-driven machine learning methods can evaluate flood susceptibility in mountainous urban areas lacking essential hydrological data, utilizing remote sensing data and limited historical inundation records. In this study, two ensemble learning algorithms, Random Forest (RF) and XGBoost, were adopted to assess the flood susceptibility of Kunming, a typical mountainous urban area prone to severe flood disasters. A flood inventory was created using flood observations from 2018 to 2022. The spatial database included 10 explanatory factors, encompassing climatic, geomorphic, and anthropogenic factors. Artificial Neural Network (ANN) and Support Vector Machine (SVM) were selected for model comparison. To minimize the influence of expert opinions on model training, this study employed a strategy of uniformly random sampling in historically non-flooded areas for negative sample selection. The results demonstrated that (1) ensemble learning algorithms offer higher accuracy than other machine learning methods, with RF achieving the highest accuracy, evidenced by an area under the curve (AUC) of 0.87, followed by XGBoost at 0.84, surpassing both ANN (0.83) and SVM (0.82); (2) the interpretability of ensemble learning highlighted the differences in the potential distribution of the training data’s positive and negative samples. Feature importance in ensemble learning can be utilized to minimize human bias in the collection of flooded-site samples, more targeted flood susceptibility maps of the study area’s road network were obtained; and (3) ensemble learning algorithms exhibited greater stability and robustness in datasets with varied negative samples, as evidenced by their performance in F1-Score, Kappa, and AUC metrics. This paper further substantiates the superiority of ensemble learning in flood susceptibility assessment tasks from the perspectives of accuracy, interpretability, and robustness, enhances the understanding of the impact of negative samples on such assessments, and optimizes the specific process for urban flood susceptibility assessment using data-driven methods.

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Beijing Normal University Interdisciplinary Fund Project

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Reference65 articles.

1. Hydro-Meteorological Disasters: Causes, Effects and Mitigation Measures with Special Reference to Early Warning with Data Driven Approaches of Forecasting;Jayawardena;Procedia IUTAM,2015

2. Urban Flood Impact Assessment: A State-of-the-Art Review;Hammond;Urban Water J.,2015

3. A Review of the Current Status of Flood Modelling for Urban Flood Risk Management in the Developing Countries;Nkwunonwo;Sci. Afr.,2020

4. Drivers of Future Urban Flood Risk;Thorne;Philos. Trans. R. Soc. A,2020

5. The Impacts of Urbanisation and Climate Change on Urban Flooding and Urban Water Quality: A Review of the Evidence Concerning the United Kingdom;Miller;J. Hydrol. Reg. Stud.,2017

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3