Using machine learning algorithms to identify predictors of social vulnerability in the event of a hazard: Istanbul case study

Author:

Kalaycıoğlu OyaORCID,Akhanlı Serhat Emre,Menteşe Emin YahyaORCID,Kalaycıoğlu Mehmet,Kalaycıoğlu Sibel

Abstract

Abstract. To what extent an individual or group will be affected by the damage of a hazard depends not just on their exposure to the event but on their social vulnerability – that is, how well they are able to anticipate, cope with, resist, and recover from the impact of a hazard. Therefore, for mitigating disaster risk effectively and building a disaster-resilient society to natural hazards, it is essential that policy makers develop an understanding of social vulnerability. This study aims to propose an optimal predictive model that allows decision makers to identify households with high social vulnerability by using a number of easily accessible household variables. In order to develop such a model, we rely on a large dataset comprising a household survey (n = 41 093) that was conducted to generate a social vulnerability index (SoVI) in Istanbul, Türkiye. In this study, we assessed the predictive ability of socio-economic, socio-demographic, and housing conditions on the household-level social vulnerability through machine learning models. We used classification and regression tree (CART), random forest (RF), support vector machine (SVM), naïve Bayes (NB), artificial neural network (ANN), k-nearest neighbours (KNNs), and logistic regression to classify households with respect to their social vulnerability level, which was used as the outcome of these models. Due to the disparity of class size outcome variables, subsampling strategies were applied for dealing with imbalanced data. Among these models, ANN was found to have the optimal predictive performance for discriminating households with low and high social vulnerability when random-majority under sampling was applied (area under the curve (AUC): 0.813). The results from the ANN method indicated that lack of social security, living in a squatter house, and job insecurity were among the most important predictors of social vulnerability to hazards. Additionally, the level of education, the ratio of elderly persons in the household, owning a property, household size, ratio of income earners, and savings of the household were found to be associated with social vulnerability. An open-access R Shiny web application was developed to visually display the performance of machine learning (ML) methods, important variables for the classification of households with high and low social vulnerability, and the spatial distribution of the variables across Istanbul neighbourhoods. The machine learning methodology and the findings that we present in this paper can guide decision makers in identifying social vulnerability effectively and hence let them prioritise actions towards vulnerable groups in terms of needs prior to an event of a hazard.

Publisher

Copernicus GmbH

Subject

General Earth and Planetary Sciences

Reference122 articles.

1. Abarca-Alvarez, F. J., Reinoso-Bellido, R., and Campos-Sánchez, F. S.: Decision Model for Predicting Social Vulnerability Using Artificial Intelligence, ISPRS Int. J. Geo-Inf., 8, 575, https://doi.org/10.3390/ijgi8120575, 2019. a, b, c, d, e

2. Acar, s., Karagoz, T., Meydan, M. C., Sahin Cinoglu, D., Kaygisiz, G., and Isik, M.: Ilcelerin sosyo-ekonomik gelismislik siralamasi arastirmasi – SEGE 2022 (Research on the socio-econimic development ranking of districts), Tech. Rep. 35, Republic Of Turkey Ministry of Industry and Technology, General Directorate of Development Agencies, https://www.sanayi.gov.tr/merkez-birimi/b94224510b7b/sege (last access: 20 March 2023), 2022. a, b

3. Adaman, F., Aslan, D., Erus, B., and Sayan, S.: ESPN Thematic Report on in-work poverty in Turkey, Tech. rep., European Commission, Brussels, https://ec.europa.eu/social/BlobServlet?docId=21089&langId=en​​​​​​​ (last access: 20 March 2023), 2015. a

4. AFAD: Disaster and Management Presidency of Turkey – 2019 Overview of Disaster Management and Natural Disaster Statistics, Tech. rep., AFAD, https://en.afad.gov.tr/kurumlar/en.afad/Afet_Istatistikleri_2020_eng_1.pdf​​​​​​​ (last access: 26 March 2023), 2019. a, b

5. Akhanli, S. E. and Hennig, C.: Comparing clusterings and numbers of clusters by aggregation of calibrated clustering validity indexes, Stat. Comput., 30, 1523–1544, https://doi.org/10.1007/s11222-020-09958-2, 2020. a

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3