A Semantic-Spatial Aware Data Conflation Approach for Place Knowledge Graphs
-
Published:2024-03-22
Issue:4
Volume:13
Page:106
-
ISSN:2220-9964
-
Container-title:ISPRS International Journal of Geo-Information
-
language:en
-
Short-container-title:IJGI
Author:
He Lianlian1, Li Hao2, Zhang Rui2
Affiliation:
1. School of Mathematics and Statistics, Hubei University of Education, No. 129 Second Gaoxin Road, East Lake Hi-Tech Zone, Wuhan 430205, China 2. School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430072, China
Abstract
Recent advances in knowledge graphs show great promise to link various data together to provide a semantic network. Place is an important part in the big picture of the knowledge graph since it serves as a powerful glue to link any data to its georeference. A key technical challenge in constructing knowledge graphs with location nodes as geographical references is the matching of place entities. Traditional methods typically rely on rule-based matching or machine-learning techniques to determine if two place names refer to the same location. However, these approaches are often limited in the feature selection of places for matching criteria, resulting in imbalanced consideration of spatial and semantic features. Deep feature-based methods such as deep learning methods show great promise for improved place data conflation. This paper introduces a Semantic-Spatial Aware Representation Learning Model (SSARLM) for Place Matching. SSARLM liberates the tedious manual feature extraction step inherent in traditional methods, enabling an end-to-end place entity matching pipeline. Furthermore, we introduce an embedding fusion module designed for the unified encoding of semantic and spatial information. In the experiment, we evaluate the approach to named places from Guangzhou and Shanghai cities in GeoNames, OpenStreetMap (OSM), and Baidu Map. The SSARLM is compared with several classical and commonly used binary classification machine learning models, and the state-of-the-art large language model, GPT-4. The results demonstrate the benefit of pre-trained models in data conflation of named places.
Funder
Educational Commission of Hubei Province of China
Reference38 articles.
1. Thakuriah, P., Tilahun, N.Y., and Zellner, M. (2017). Seeing Cities through Big Data: Research, Methods and Applications in Urban Informatics, Springer. 2. Manville, C., Cochrane, G., Jonathan, C.A.V.E., Millard, J., Pederson, J.K., Thaarup, R.K., WiK, J.K., and WiK, M.W. (2014). Mapping Smart Cities in the EU, European Parliamentary Research Service. 3. Allemang, D., and Hendler, J. (2011). Semantic Web for the Working Ontologist: Effective Modeling in RDFS and OWL, Elsevier. 4. Liu, J., Guo, D., Liu, G., Zhao, Y., Yang, W., and Tang, L. (2022, January 21–23). Construction Method of City-Level Geographic Knowledge Graph Based on Geographic Entity. Proceedings of the International Conference on Geoinformatics and Data Analysis, ICGDA 2022, Paris, France. 5. Kuhn, W., Kauppinen, T., and Janowicz, K. (2014, January 24–26). Linked data-a paradigm shift for geographic information science. Proceedings of the Geographic Information Science: 8th International Conference, GIScience 2014, Vienna, Austria.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|