ReJOOSp: Reinforcement Learning for Join Order Optimization in SPARQL-Reference-Cited by-同舟云学术

ReJOOSp: Reinforcement Learning for Join Order Optimization in SPARQL

Published:2024-06-27 Issue:7 Volume:8 Page:71
ISSN:2504-2289
Container-title:Big Data and Cognitive Computing
language:en
Short-container-title:BDCC

Author:

Warnke Benjamin¹^ORCID,Martens Kevin¹,Winker Tobias¹,Groppe Sven¹^ORCID,Groppe Jinghua¹,Adhiyaman Prasad²,Srinivasan Sruthi²^ORCID,Krishnakumar Shridevi²^ORCID

Affiliation:

1. Institute of Information Systems, University of Lübeck, 23562 Lübeck, Germany

2. Centre for Advanced Data Science, Vellore Institute of Technology, Chennai 600127, India

Abstract

The choice of a good join order plays an important role in the query performance of databases. However, determining the best join order is known to be an NP-hard problem with exponential growth with the number of joins. Because of this, nonlearning approaches to join order optimization have a longer optimization and execution time. In comparison, the models of machine learning, once trained, can construct optimized query plans very quickly. Several efforts have applied machine learning to optimize join order for SQL queries outperforming traditional approaches. In this work, we suggest a reinforcement learning technique for join optimization for SPARQL queries, ReJOOSp. SPARQL queries typically contain a much higher number of joins than SQL queries and so are more difficult to optimize. To evaluate ReJOOSp, we further develop a join order optimizer based on ReJOOSp and integrate it into the Semantic Web DBMS Luposdate3000. The evaluation of ReJOOSp shows its capability to significantly enhance query performance by achieving high-quality execution plans for a substantial portion of queries across synthetic and real-world datasets.

Funder

Deutsche Forschungsgemeinschaft

German Federal Ministry of Education and Research within the funding program quantum technologies

Publisher

MDPI AG

Link

https://www.mdpi.com/2504-2289/8/7/71/pdf

Reference55 articles.

1. Scheufele, W., and Moerkotte, G. (1997, January 12–14). On the complexity of generating optimal plans with cross products. Proceedings of the Sixteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, Tucson, AZ, USA.

2. Allam, J.R. (2018). Evaluation of a Greedy Join-Order Optimization Approach Using the IMDB Dataset. [Ph.D. Thesis, University of Magdeburg].

3. A Survey on Advancing the DBMS Query Optimizer: Cardinality Estimation, Cost Model, and Plan Enumeration;Lan;Data Sci. Eng.,2021

4. Amer-Yahia, S., Christophides, V., Kementsietsidis, A., Garofalakis, M.N., Idreos, S., and Leroy, V. (2014). Exploiting the query structure for efficient join ordering in SPARQL queries. EDBT, Proceedings of the International Conference on Extending Database Technology, Athens, Greece, 24–28 March 2014, Open Proceedings.

5. Marcus, R., and Papaemmanouil, O. (2018, January 10). Deep Reinforcement Learning for Join Order Enumeration. Proceedings of the First International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, New York, NY, USA.