Adaptive Geoparsing Method for Toponym Recognition and Resolution in Unstructured Text-Reference-Cited by-同舟云学术

Adaptive Geoparsing Method for Toponym Recognition and Resolution in Unstructured Text

Published:2020-09-17 Issue:18 Volume:12 Page:3041
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Aldana-Bobadilla Edwin^ORCID,Molina-Villegas Alejandro^ORCID,Lopez-Arevalo Ivan^ORCID,Reyes-Palacios Shanel,Muñiz-Sanchez Victor^ORCID,Arreola-Trapala Jean

Abstract

The automatic extraction of geospatial information is an important aspect of data mining. Computer systems capable of discovering geographic information from natural language involve a complex process called geoparsing, which includes two important tasks: geographic entity recognition and toponym resolution. The first task could be approached through a machine learning approach, in which case a model is trained to recognize a sequence of characters (words) corresponding to geographic entities. The second task consists of assigning such entities to their most likely coordinates. Frequently, the latter process involves solving referential ambiguities. In this paper, we propose an extensible geoparsing approach including geographic entity recognition based on a neural network model and disambiguation based on what we have called dynamic context disambiguation. Once place names are recognized in an input text, they are solved using a grammar, in which a set of rules specifies how ambiguities could be solved, in a similar way to that which a person would utilize, considering the context. As a result, we have an assignment of the most likely geographic properties of the recognized places. We propose an assessment measure based on a ranking of closeness relative to the predicted and actual locations of a place name. Regarding this measure, our method outperforms OpenStreetMap Nominatim. We include other assessment measures to assess the recognition ability of place names and the prediction of what we called geographic levels (administrative jurisdiction of places).

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/12/18/3041/pdf

Reference35 articles.

1. Report on the State of the Art of Named Entity and Word Sense Disambiguation;Aguirre,2015

2. Every document has a geographical scope

3. A pragmatic guide to geoparsing evaluation

4. A conceptual density‐based approach for the disambiguation of toponyms

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey on geocoding: algorithms and datasets for toponym resolution;Language Resources and Evaluation;2024-06-10

2. Beyond extraction accuracy: addressing the quality of geographical named entity through advanced recognition and correction models using a modified BERT framework;Geo-spatial Information Science;2024-05-28

3. Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection;Proceedings of the Cognitive Models and Artificial Intelligence Conference;2024-05-25

4. MAWI: Mapping the Unmapped in Wikipedia via Geographic Information Extraction;Communications in Computer and Information Science;2024

5. A Study on Toponymic Entity Recognition Based on Pre-Trained Models Fused with Local Features for Genglubu in the South China Sea;Electronics;2023-12-19