RDFAdaptor: Efficient ETL Plugins for RDF Data Process
Author:
Li Jiao1, Xian Guojian12, Zhao Ruixue12, Huang Yongwen1, Kou Yuantao12, Luo Tingting1, Sun Tan23
Affiliation:
1. Agricultural Information Institute of CAAS , Beijing , China 2. Key Laboratory of Agricultural Big Data , Ministry of Agriculture and Rural Affairs , Beijing , China 3. Chinese Academy of Agricultural Sciences , Beijing , China
Abstract
Abstract
Purpose
The interdisciplinary nature and rapid development of the Semantic Web led to the mass publication of RDF data in a large number of widely accepted serialization formats, thus developing out the necessity for RDF data processing with specific purposes. The paper reports on an assessment of chief RDF data endpoint challenges and introduces the RDF Adaptor, a set of plugins for RDF data processing which covers the whole life-cycle with high efficiency.
Design/methodology/approach
The RDFAdaptor is designed based on the prominent ETL tool—Pentaho Data Integration—which provides a user-friendly and intuitive interface and allows connect to various data sources and formats, and reuses the Java framework RDF4J as middleware that realizes access to data repositories, SPARQL endpoints and all leading RDF database solutions with SPARQL 1.1 support. It can support effortless services with various configuration templates in multi-scenario applications, and help extend data process tasks in other services or tools to complement missing functions.
Findings
The proposed comprehensive RDF ETL solution—RDFAdaptor—provides an easy-to-use and intuitive interface, supports data integration and federation over multi-source heterogeneous repositories or endpoints, as well as manage linked data in hybrid storage mode.
Research limitations
The plugin set can support several application scenarios of RDF data process, but error detection/check and interaction with other graph repositories remain to be improved.
Practical implications
The plugin set can provide user interface and configuration templates which enable its usability in various applications of RDF data generation, multi-format data conversion, remote RDF data migration, and RDF graph update in semantic query process.
Originality/value
This is the first attempt to develop components instead of systems that can include extract, consolidate, and store RDF data on the basis of an ecologically mature data warehousing environment.
Publisher
Walter de Gruyter GmbH
Reference21 articles.
1. Arenas, M., Bertails, A., Prud'hommeaux, E., & Sequeda, J. (2012). A direct mapping of relational data to RDF. Retrieved from https://www.w3.org/TR/2012/REC-rdb-direct-mapping-20120927/ 2. Bishop, B., Kiryakov, A., Ognyanoff, D., Peikov, I., Tashev, Z., & Ruslan, V.R. (2011). OWLIM: A family of scalable semantic repositories. Semantic Web, 2(1), 33–42. doi: 10.3233/SW-2011-0026 3. Bizer, C., Heath, T., & Berners-Lee, T. (2009). Linked data—the story so far. International Journal on Semantic Web and Information Systems, 5(3), 1–22. doi: 10.4018/jswis.2009081901 4. Broekstra, J., Kampman, A., & Harmelen, F. (2002). Sesame: A generic architecture for storing and querying RDF and RDF schema. In Proceedings of the Semantic Web-ISWC 2002, 54–68. Berlin, Heidelberg: Springer. doi: 10.1007/3-540-48005-6_7 5. Cyganiak, R., Wood, D., & Lanthaler, M. (2014). RDF 1.1 Concepts and Abstract Syntax. W3C Recommendation. Retrieved from https://www.w3.org/TR/rdf11-concepts/
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Exploring the use of gender-fair language by influencers;European Journal of Management and Business Economics;2023-07-11 2. Scientific Knowledge Graph-driven Research Profiling;The 6th International Conference on Computer Science and Application Engineering;2022-10-21
|
|