Affiliation:
1. Institute of Computer and Information Science, Chongqing Normal University, Chongqing 400047, China
2. Key Laboratory for Digital Land and Resources of Jiangxi Province, East China University of Technology, Nanchang 330013, China
3. School of Information and Engineering, East China University of Technology, Nanchang 330013, China
Abstract
Landslide susceptibility assessment (LSA) based on machine learning methods has been widely used in landslide geological hazard management and research. However, the problem of sample imbalance in landslide susceptibility assessment, where landslide samples tend to be much smaller than non-landslide samples, is often overlooked. This problem is often one of the important factors affecting the performance of landslide susceptibility models. In this paper, we take the Wanzhou district of Chongqing city as an example, where the total number of data sets is more than 580,000 and the ratio of positive to negative samples is 1:19. We oversample or undersample the unbalanced landslide samples to make them balanced, and then compare the performance of machine learning models with different sampling strategies. Three classic machine learning algorithms, logistic regression, random forest and LightGBM, are used for LSA modeling. The results show that the model trained directly using the unbalanced sample dataset performs the worst, showing an extremely low recall rate, indicating that its predictive ability for landslide samples is extremely low and cannot be applied in practice. Compared with the original dataset, the sample set optimized through certain methods has demonstrated improved predictive performance across various classifiers, manifested in the improvement of AUC value and recall rate. The best model was the random forest model using over-sampling (O_RF) (AUC = 0.932).
Funder
Engineering Research Center for Seismic Disaster Prevention
Key Laboratory for Digital Land and Resources of Jiangxi Province, East China University of Technology
Science and Technology Research Proiect of Jiangxi Provincial Department of Education
Education science planning project at the university level of East China University of Technology
Subject
Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development
Reference58 articles.
1. Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance;Merghadi;Earth-Sci. Rev.,2020
2. Deep learning-based landslide susceptibility mapping;Azarafza;Sci. Rep.,2021
3. Nikoobakht, S., Azarafza, M., Akgün, H., and Derakhshani, R. (2022). Landslide susceptibility assessment by using convolutional neural network. Appl. Sci., 12.
4. Machine learning for predicting landslide risk of Rohingya refugee camp infrastructure;Ahmed;J. Inf. Telecommun.,2020
5. A comparative study between popular statistical and machine learning methods for simulating volume of landslides;Shirzadi;Catena,2017
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献