Location Extraction from Social Media-Reference-Cited by-同舟云学术

Location Extraction from Social Media

Published:2018-10-31 Issue:4 Volume:36 Page:1-27
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Middleton Stuart E.¹^ORCID,Kordopatis-Zilos Giorgos²,Papadopoulos Symeon²,Kompatsiaris Yiannis²

Affiliation:

1. University of Southampton IT Innovation Centre, Southampton, UK

2. Information Technologies Institute, CERTH, Thermi-Thessaloniki, Greece

Abstract

Location extraction, also called “toponym extraction,” is a field covering geoparsing, extracting spatial representations from location mentions in text, and geotagging, assigning spatial coordinates to content items. This article evaluates five “best-of-class” location extraction algorithms. We develop a geoparsing algorithm using an OpenStreetMap database, and a geotagging algorithm using a language model constructed from social media tags and multiple gazetteers. Third-party work evaluated includes a DBpedia-based entity recognition and disambiguation approach, a named entity recognition and Geonames gazetteer approach, and a Google Geocoder API approach. We perform two quantitative benchmark evaluations, one geoparsing tweets and one geotagging Flickr posts, to compare all approaches. We also perform a qualitative evaluation recalling top N location mentions from tweets during major news events. The OpenStreetMap approach was best (F1 0.90+) for geoparsing English, and the language model approach was best (F1 0.66) for Turkish. The language model was best (F1@1km 0.49) for the geotagging evaluation. The map database was best (R@20 0.60+) in the qualitative evaluation. We report on strengths, weaknesses, and a detailed failure analysis for the approaches and suggest concrete areas for further research.

Funder

REVEAL project

InVID project

TRIDEC project

Horizon 2020 Program of the European Commission

7th Framework Program of the European Commission

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/3202662