An evaluation of uplift mapping languages

Author:

Crotti Junior Ademar,Debruyne Christophe,Brennan Rob,O’Sullivan Declan

Abstract

Purpose This paper aims to evaluate the state-of-the-art in CSV uplift tools. Based on this evaluation, a method that incorporates data transformations into uplift mapping languages by means of functions is proposed and evaluated. Typically, tools that map non-resource description framework (RDF) data into RDF format rely on the technology native to the source of the data when data transformation is required. Depending on the data format, data manipulation can be performed using underlying technology, such as relational database management system (RDBMS) for relational databases or XPath for XML. For CSV/Tabular data, there is no such underlying technology, and instead, it requires either a transformation of source data into another format or pre/post-processing techniques. Design/methodology/approach To evaluate the state-of-the-art in CSV uplift tools, the authors present a comparison framework and have applied it to such tools. A key feature evaluated in the comparison framework is data transformation functions. They argue that existing approaches for transformation functions are complex – in that a number of steps and tools are required. The proposed method, FunUL, in contrast, defines functions independent of the source data being mapped into RDF, as resources within the mapping itself. Findings The approach was evaluated using two typical real-world use cases. The authors have compared how well our approach and others (that include transformation functions as part of the uplift mapping) could implement an uplift mapping from CSV/Tabular into RDF. This comparison indicates that the authors’ approach performs well for these use cases. Originality/value This paper presents a comparison framework and applies it to the state-of-the-art in CSV uplift tools. Furthermore, the authors describe FunUL, which, unlike other related work, defines functions as resources within the uplift mapping itself, integrating data transformation functions and mapping definitions. This makes the generation of RDF from source data transparent and traceable. Moreover, as functions are defined as resources, these can be reused multiple times within mappings.

Publisher

Emerald

Subject

Computer Networks and Communications,Information Systems

Reference28 articles.

1. Biron, P. and Malhotra, A. (2004), XML Schema Part 2: Datatypes Second Edition, available at: www.w3.org/TR/xmlschema-2/

2. D2RQ – Treating non-RDF Databases as Virtual RDF Graphs,2004

3. Building the Seshat ontology for a global history databank,2016

4. Das, S. Sundara, S. and Cyganiak, R. (2012), R2RML: RDB to RDF Mapping Language, available at: www.w3.org/TR/r2rml/

5. R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings,2016

Cited by 8 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Boosting Knowledge Graph Generation from Tabular Data with RML Views;The Semantic Web;2023

2. A Vocabulary for Describing Mapping Quality Assessment, Refinement and Validation;2021 IEEE 15th International Conference on Semantic Computing (ICSC);2021-01

3. A Framework for Assessing and Refining the Quality of R2RML mappings;Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services;2020-11-30

4. The Semantic Data Dictionary – An Approach for Describing and Annotating Data;Data Intelligence;2020-10

5. VoIDext: Vocabulary and Patterns for Enhancing Interoperable Datasets with Virtual Links;Lecture Notes in Computer Science;2019

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3