Landslide susceptibility prediction modelling based on semi‐supervised XGBoost model

Author:

Shua Qiangqiang1,Peng Hongbin2,Li Jingkai3

Affiliation:

1. Building Environment Engineering Technology Research Center Sichuan Institute of Arts and Sciences Dazhou Sichuan China

2. Sichuan Institute of Arts and Sciences Dazhou Sichuan China

3. China HuaXi Engineering Design and Construction Limited Company Chongzhou Sichuan China

Abstract

In the process of landslide susceptibility prediction (LSP) modelling, there are some problems in the model dataset relating to landslide and non‐landslide samples, such as landslide sample errors, subjective randomness and low accuracy of non‐landslide sample selection. In order to solve the above problems, a semi‐supervised machine learning model for LSP is innovatively proposed. Firstly, Yanchang County of Shanxi Province, China, is taken as the study area. Secondly, the frequency ratio values of 12 environmental factors (elevation, slope, aspect, etc.) and the randomly selected twice non‐landslides are used to form the initial model datasets. Thirdly, an extreme gradient boosting (XGBoost) model is adopted for training and testing the initial datasets, so as to produce initial landslide susceptibility maps (LSMs) which are divided into very low, low, moderate, high and very high susceptibility levels. Next, the landslide samples in initial LSMs with very low and low susceptibility levels are excluded to improve the accuracy of landslide samples, and the unlabelled twice non‐landslide samples in initial LSMs with low and very low susceptibility levels are randomly selected to ensure the accuracy of non‐landslide samples. These new obtained landslide and non‐landslide samples are reimported into XGBoost model to construct the semi‐supervised XGBoost (SSXGBoost) model. Finally, accuracy, kappa coefficient and statistical indexes of susceptibility indexes are adopted to assess the LSP performance of XGBoost and SSXGBoost models. Results show that SSXGBoost model has remarkably better LSP performance than that of XGBoost model. Conclusively, the proposed SSXGBoost model effectively overcomes the problems that the accuracy of landslide samples needs to be further improved and that non‐landslide samples are difficult to select accurately.

Publisher

Wiley

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3