Affiliation:
1. Key Laboratory of High Confidence Software Technologies, CS, Peking University, China
2. ZTE Corporation, China
Abstract
Query optimization based on deep reinforcement learning (DRL) has become a hot research topic recently. Despite the achieved promising progress, DRL optimizers still face great challenges of robustly producing efficient plans, due to the vast search space for both join order and operator selection and the highly varying execution latency taken as the feedback signal. In this paper, we propose LOGER, a learned optimizer towards generating efficient and robust plans, aiming at producing both efficient join orders and operators. LOGER first utilizes Graph Transformer to capture relationships between tables and predicates. Then, the search space is reorganized, in which LOGER learns to restrict specific operators instead of directly selecting one for each join, while utilizing DBMS built-in optimizer to select physical operators under the restrictions. Such a strategy exploits expert knowledge to improve the robustness of plan generation while offering sufficient plan search flexibility. Furthermore, LOGER introduces
ε
-beam search, which keeps multiple search paths that preserve promising plans while performing guided exploration. Finally, LOGER introduces a loss function with reward weighting to further enhance performance robustness by reducing the fluctuation caused by poor operators, and log transformation to compress the range of rewards. We conduct experiments on Join Order Benchmark (JOB), TPC-DS and Stack Overflow, and demonstrate that LOGER can achieve a performance better than existing learned query optimizers, with a 2.07x speedup on JOB compared with PostgreSQL.
Publisher
Association for Computing Machinery (ACM)
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Reference29 articles.
1. Using confidence bounds for exploitation-exploration trade-offs;Auer Peter;Journal of Machine Learning Research 3,2002
2. Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks
3. Nan Ding and Radu Soricut . 2017. Cold-start reinforcement learning with softmax policy gradient. Advances in Neural Information Processing Systems 30 ( 2017 ). Nan Ding and Radu Soricut. 2017. Cold-start reinforcement learning with softmax policy gradient. Advances in Neural Information Processing Systems 30 (2017).
4. Vijay Prakash Dwivedi and Xavier Bresson . 2020. A generalization of transformer networks to graphs. arXiv preprint arXiv:2012.09699 ( 2020 ). Vijay Prakash Dwivedi and Xavier Bresson. 2020. A generalization of transformer networks to graphs. arXiv preprint arXiv:2012.09699 (2020).
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Low Rank Approximation for Learned Query Optimization;Proceedings of the Seventh International Workshop on Exploiting Artificial Intelligence Techniques for Data Management;2024-06-09
2. Learned Query Optimizer: What is New and What is Next;Companion of the 2024 International Conference on Management of Data;2024-06-09
3. Constrained Quadratic Model for Optimizing Join Orders;Workshop on Quantum Computing and Quantum-Inspired Technology for Data-Intensive Systems and Applications;2024-06-09
4. Robust Query Optimization in the Era of Machine Learning: State-of-the-Art and Future Directions;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
5. GLO: Towards Generalized Learned Query Optimization;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13