Deep Learning for Toponym Resolution: Geocoding Based on Pairs of Toponyms-Reference-Cited by-同舟云学术

Deep Learning for Toponym Resolution: Geocoding Based on Pairs of Toponyms

Published:2021-12-02 Issue:12 Volume:10 Page:818
ISSN:2220-9964
Container-title:ISPRS International Journal of Geo-Information
language:en
Short-container-title:IJGI

Author:

Fize Jacques^ORCID,Moncla Ludovic^ORCID,Martins Bruno^ORCID

Abstract

Geocoding aims to assign unambiguous locations (i.e., geographic coordinates) to place names (i.e., toponyms) referenced within documents (e.g., within spreadsheet tables or textual paragraphs). This task comes with multiple challenges, such as dealing with referent ambiguity (multiple places with a same name) or reference database completeness. In this work, we propose a geocoding approach based on modeling pairs of toponyms, which returns latitude-longitude coordinates. One of the input toponyms will be geocoded, and the second one is used as context to reduce ambiguities. The proposed approach is based on a deep neural network that uses Long Short-Term Memory (LSTM) units to produce representations from sequences of character n-grams. To train our model, we use toponym co-occurrences collected from different contexts, namely textual (i.e., co-occurrences of toponyms in Wikipedia articles) and geographical (i.e., inclusion and proximity of places based on Geonames data). Experiments based on multiple geographical areas of interest—France, United States, Great-Britain, Nigeria, Argentina and Japan—were conducted. Results show that models trained with co-occurrence data obtained a higher geocoding accuracy, and that proximity relations in combination with co-occurrences can help to obtain a slightly higher accuracy in geographical areas with fewer places in the data sources.

Funder

Agence Nationale de la Recherche

Publisher

MDPI AG

Subject

Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development

Link

https://www.mdpi.com/2220-9964/10/12/818/pdf

Reference38 articles.

1. A survey on the geographic scope of textual documents

2. Approaches to disambiguating toponyms

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Geographical and linguistic perspectives on developing geoparsers with generic resources;International Journal of Geographical Information Science;2024-06-30

2. A survey on geocoding: algorithms and datasets for toponym resolution;Language Resources and Evaluation;2024-06-10

3. GIS-based relationship between pathway names and landscape. A multilingual case study: Euskadi, Spain;GeoJournal;2024-04-29

4. A Hierarchy-Aware Geocoding Model Based on Cross-Attention within the Seq2Seq Framework;ISPRS International Journal of Geo-Information;2024-04-17

5. A Spatially-Aware Data-Driven Approach to Automatically Geocoding Non-Gazetteer Place Names;ACM Transactions on Spatial Algorithms and Systems;2023-12-11