Affiliation:
1. North China Electric Power University, China
Abstract
This article describes how geographic information systems (GISs) can enable, enrich and enhance geospatial applications and services. Accurate calculation of the similarity among geospatial entities that belong to different data sources is of great importance for geospatial data linking. At present, most research works use the name or category of the entity to measure the similarity of geographic information. Although the geospatial relationship is significant for geographic similarity measure, it has been ignored by most of the previous works. This article introduces the geospatial relationship and topology, and proposes an approach to compute the geospatial record similarity based on multiple features including the geospatial relationships, category and name tags. In order to improve the flexibility and operability, supervised machine learning such as SVM is used for the task of classifying pairs of mapping records. The authors test their approach using three sources, namely, OpenStreetMap, Google and Wikimapia. The results showed that the proposed approach obtained high correlation with the human judgements.
Reference34 articles.
1. Geographic knowledge extraction and semantic similarity in OpenStreetMap
2. Linking geographic vocabularies through WordNet
3. Baxter, R., Christen, P., & Churches, T. (2003). A Comparison of Fast Blocking Methods for Record Linkage. In Proc. ACM Workshop Data Cleaning, Record Linkage and Object ConsolidationSIGKDD ’03 (pp. 25-27).
4. Bellahsene, Z., Bonifati, A., Duchateau, F., & Velegrakis, Y. (2011). On evaluating schema matching and mapping. In Schema matching and mapping (pp. 253-291).
5. A geospatial search engine for discovering multi-format geospatial data across the web