Abstract
Web text, using natural language to describe a disaster event, contains a considerable amount of disaster information. Automatic extraction from web text of this disaster information (e.g., time, location, casualties, and disaster losses) is an important supplement to conventional disaster monitoring data. This study extracted and compared the characteristics of earthquake disaster information from web news media reports (news reports) and online disaster reduction agency reports (professional reports). Using earthquakes in China from 2015 to 2017 as a case study, a series of rules were created for extracting earthquake event information, including temporal extraction rules, a location trigger dictionary, and an attribute trigger dictionary. The differences in characteristics of news reports and professional reports were investigated in terms of their quantity and spatiotemporal distribution through statistical analysis, geocoding, and kernel density estimation. The information extracted from each set of reports was also compared with authoritative data. The results indicated that news reports are more extensive and have richer information. In contrast, professional reports are less repetitive as well as more accurate and standardized, mainly focusing on earthquakes with Ms ≥ 4 and/or earthquakes that may cause damage. These characteristics of disaster information from different web texts sources can be used to improve the efficiency and analysis of disaster information extraction. In addition, the rule-based approach proposed herein was found to be an accurate and viable way to extract earthquake information from web texts. The approach provided the technical basics and background information to support further research seeking human-centric disaster information, which cannot be acquired using traditional instrument monitoring methods, from web text.
Funder
the Strategic Priority Research Program (Class A) of the Chinese Academy of Sciences
Subject
Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development
Reference20 articles.
1. The Big Data Analysis of the Public Opinion 72 Hours after the Jiuzhaigou Earthquakehttp://yuqing.people.com.cn/n1/2017/0815/c209043-29471816.html
2. An Efficient Damage Information Extraction from Government Disaster Report;Shin;J. Int. Comput. Serv.,2017
3. Using Machine Learning for Extracting Information from Natural Disaster News Reports;Téllez;Comput. Y Sist.,2009
4. Interpretation of Event Spatio-temporal and Attribute Information in Chinese Text;Zhang,2013
5. Spatiotemporal and semantic information extraction from Web news reports about natural hazards
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献