SPARQL2Flink: Evaluation of SPARQL Queries on Apache Flink-Reference-Cited by-同舟云学术

SPARQL2Flink: Evaluation of SPARQL Queries on Apache Flink

Published:2021-07-30 Issue:15 Volume:11 Page:7033
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Ceballos Oscar^ORCID,Ramírez Restrepo Carlos Alberto^ORCID,Pabón María Constanza^ORCID,Castillo Andres M.^ORCID,Corcho Oscar^ORCID

Abstract

Existing SPARQL query engines and triple stores are continuously improved to handle more massive datasets. Several approaches have been developed in this context proposing the storage and querying of RDF data in a distributed fashion, mainly using the MapReduce Programming Model and Hadoop-based ecosystems. New trends in Big Data technologies have also emerged (e.g., Apache Spark, Apache Flink); they use distributed in-memory processing and promise to deliver higher data processing performance. In this paper, we present a formal interpretation of some PACT transformations implemented in the Apache Flink DataSet API. We use this formalization to provide a mapping to translate a SPARQL query to a Flink program. The mapping was implemented in a prototype used to determine the correctness and performance of the solution. The source code of the project is available in Github under the MIT license.

Funder

Departamento Administrativo de Ciencia, Tecnología e Innovación

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/15/7033/pdf

Reference64 articles.

1. Resource Description Framework (RDF): Concepts and Abstract Syntaxhttps://www.w3.org/TR/rdf-concepts/

2. SPIDER

3. RDF in the clouds: a survey

4. Processing SPARQL queries over distributed RDF graphs

5. Query Processing over Large RDF using SPARQL in Big Data

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A real-time approach for smart building operations prediction using rule-based complex event processing and SPARQL query;The Journal of Supercomputing;2024-06-12

2. Machine Learning and Deep Learning for Big Data Analysis;Advances in Business Information Systems and Analytics;2024-01-04

3. Efficient query evaluation techniques over large amount of distributed linked data;Information Systems;2023-05