qEndpoint: A novel triple store architecture for large RDF graphs-Reference-Cited by-同舟云学术

qEndpoint: A novel triple store architecture for large RDF graphs

Published:2024-08-20 Issue: Volume: Page:1-19
ISSN:2210-4968
Container-title:Semantic Web
language:
Short-container-title:SW

Author:

Willerval Antoine¹²,Diefenbach Dennis¹,Bonifati Angela²

Affiliation:

1. The QA Company, France

2. CNRS Liris, Lyon 1 University, IUF, France

Abstract

In the relational database realm, there has been a shift towards novel hybrid database architectures combining the properties of transaction processing (OLTP) and analytical processing (OLAP). OLTP workloads are made up by read and write operations on a small number of rows and are typically addressed by indexes such as B+trees. On the other side, OLAP workloads consists of big read operations that scan larger parts of the dataset. To address both workloads some databases introduced an architecture using a buffer or delta partition. Precisely, changes are accumulated in a write-optimized delta partition while the rest of the data is compressed in the read-optimized main partition. Periodically, the delta storage is merged in the main partition. In this paper we investigate for the first time how this architecture can be implemented and behaves for RDF graphs. We describe in detail the indexing-structures one can use for each partition, the merge process as well as the transactional management. We study the performances of our triple store, which we call qEndpoint, over two popular benchmarks, the Berlin SPARQL Benchmark (BSBM) and the recent Wikidata Benchmark (WDBench). We are also studying how it compares against other public Wikidata endpoints. This allows us to study the behavior of the triple store for different workloads, as well as the scalability over large RDF graphs. The results show that, compared to the baselines, our triple store allows for improved indexing times, better response time for some queries, higher insert and delete rates, and low disk and memory footprints, making it ideal to store and serve large Knowledge Graphs.

Publisher

IOS Press

Reference24 articles.

1. The design and implementation of modern column-oriented database systems;Abadi;Found. Trends Databases,2013

2. WDBench: A Wikidata Graph Query Benchmark

3. QLever

4. The Berlin sparql benchmark;Bizer;International Journal on Semantic Web and Information Systems (IJSWIS),2009

5. D. Diefenbach and J.M. Giménez-García, HDTCat: Let’s make HDT generation scale, in: International Semantic Web Conference, Springer, 2020, pp. 18–33.