Abstract
Typhoons are major natural disasters in China. Much typhoon information is contained in a large number of network media resources, such as news reports and volunteered geographic information (VGI) data, and these are the implicit data sources for typhoon research. However, two problems arise when using typhoon information from Chinese news reports. Since the Chinese language lacks natural delimiters, word segmentation error results in trigger mismatches. Additionally, the polysemy of Chinese affects the classification of triggers. Second, there is no authoritative classification system for typhoon events. This paper defines a classification system for typhoon events, and then uses the system in a neural network model, lattice-structured bidirectional long–short-term memory with a conditional random field (BiLSTM-CRF), to detect these events in Chinese online news. A typhoon dataset is created using texts from the China Weather Typhoon Network. Three other datasets are generated from general Chinese web pages. Experiments on these four datasets show that the model can tackle the problems mentioned above and accurately detect typhoon events in Chinese news reports.
Subject
Management, Monitoring, Policy and Law,Renewable Energy, Sustainability and the Environment,Geography, Planning and Development
Reference49 articles.
1. Progress on disaster events information acquisition from web text;Han;J. Geo Inf. Sci.,2018
2. Chinese news event detection and theme extraction based on convolution neural network and K-means;Zhang;Sci. Technol. Eng.,2020
3. A Survey of Techniques for Event Detection in Twitter
4. Typhoon Event Information Extraction Method Based on Event and Context Characteristics;Huang;J. Geomat. Sci. Technol.,2019
5. Extracting and classifying typhoon disaster information based on volunteered geographic information from Chinese Sina microblog
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献