A System for Aligning Geographical Entities from Large Heterogeneous Sources-Reference-Cited by-同舟云学术

A System for Aligning Geographical Entities from Large Heterogeneous Sources

Published:2022-01-28 Issue:2 Volume:11 Page:96
ISSN:2220-9964
Container-title:ISPRS International Journal of Geo-Information
language:en
Short-container-title:IJGI

Author:

Melo André^ORCID,Er-Rahmadi Btissam^ORCID,Pan Jeff Z.^ORCID

Abstract

Aligning points of interest (POIs) from heterogeneous geographical data sources is an important task that helps extend map data with information from different datasets. This task poses several challenges, including differences in type hierarchies, labels (different formats, languages, and levels of detail), and deviations in the coordinates. Scalability is another major issue, as global-scale datasets may have tens or hundreds of millions of entities. In this paper, we propose the GeographicaL Entities AligNment (GLEAN) system for efficiently matching large geographical datasets based on spatial partitioning with an adaptable margin. In particular, we introduce a text similarity measure based on the local-context relevance of tokens used in combination with sentence embeddings. We then come up with a scalable type embedding model. Finally, we demonstrate that our proposed system can efficiently handle the alignment of large datasets while improving the quality of alignments using the proposed entity similarity measure.

Publisher

MDPI AG

Subject

Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development

Link

https://www.mdpi.com/2220-9964/11/2/96/pdf

Reference26 articles.

1. Citizens as sensors: the world of volunteered geography

2. Spatial data fusion in Spatial Data Infrastructures using Linked Data

3. Matching Points of Interest from Different Social Networking Sites

4. Finding corresponding objects when integrating several geo-spatial datasets

5. A feature-based approach to conflation of geospatial sources

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Spatial-Aware Representation Learning Model for Link Completion in GeoKG: A Case Study on Wikidata and OpenStreetMap;2023 11th International Conference on Agro-Geoinformatics (Agro-Geoinformatics);2023-07-25

2. Conflating point of interest (POI) data: A systematic review of matching methods;Computers, Environment and Urban Systems;2023-07

3. Exploring science-technology linkages: A deep learning-empowered solution;Information Processing & Management;2023-03