Author:
Tao Liufeng,Xie Zhong,Xu Dexin,Ma Kai,Qiu Qinjun,Pan Shengyong,Huang Bo
Abstract
Toponym recognition, or the challenge of detecting place names that have a similar referent, is involved in a number of activities connected to geographical information retrieval and geographical information sciences. This research focuses on recognizing Chinese toponyms from social media communications. While broad named entity recognition methods are frequently used to locate places, their accuracy is hampered by the many linguistic abnormalities seen in social media posts, such as informal sentence constructions, name abbreviations, and misspellings. In this study, we describe a Chinese toponym identification model based on a hybrid neural network that was created with these linguistic inconsistencies in mind. Our method adds a number of improvements to a standard bidirectional recurrent neural network model to help with location detection in social media messages. We demonstrate the results of a wide-ranging evaluation of the performance of different supervised machine learning methods, which have the natural advantage of avoiding human design features. A set of controlled experiments with four test datasets (one constructed and three public datasets) demonstrates the performance of supervised machine learning that can achieve good results on the task, significantly outperforming seven baseline models.
Funder
National Natural Science Foundation of China
Beijing Key Laboratory of Urban Spatial Information Engineering
China Postdoctoral Science Foundation
Wuhan Multi-Element Urban Geological Survey Demonstration Project
The Hubei Key Laboratory of Intelligent Geo-Information Processing
Wuhan Science and Technology Plan Project
Subject
Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development
Reference42 articles.
1. Processing social media messages in mass emergency: A survey;Imran;ACM Comput. Surv.,2015
2. Silverman, L. (2017, September 12). Facebook, Twitter Replace 911 Calls for Stranded in Houston. Available online: https://www.npr.org/sections/alltechconsidered/2017/08/28/546831780/texas-police-and-residents-turn-to-social-media-to-communicateamid-harvey.
3. Deep learning for real-time social media text classification for situation awareness—Using hurricanes Sandy, Harvey, and Irma as case studies;Yu;Int. J. Digit. Earth,2019
4. A natural language processing and geospatial clustering framework for harvesting local place names from geotagged housing advertisements;Hu;Int. J. Geogr. Inf. Sci.,2018
5. Freire, N., Borbinha, J., Calado, P., and Martins, B. (2011, January 13–17). A metadata geoparsing system for place name recognition and resolution in metadata records. Proceedings of the 11th International ACM/IEEE Joint Conference on Digital Libraries, Ottawa, ON, Canada.
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献