Affiliation:
1. Technical University of Munich
Abstract
Current hardware development greatly influences the design decisions of modern database systems. For many modern performance-focused database systems, query compilation emerged as an integral part and different approaches for code generation evolved, making use of standard compilers, general-purpose compiler libraries, or domain-specific code generators. However, development primarily focused on the dominating x86-64 server architecture; but neglected current hardware developments towards other CPU architectures like ARM and other RISC architectures.
Therefore, we explore the design space of code generation in database systems considering a variety of state-of-the-art compilation approaches with a set of qualitative and quantitative metrics. Based on our findings, we have developed a new code generator called FireARM for AArch64-based systems in our database system, Umbra. We identify general as well as architecture-specific challenges for custom code generation in databases and provide potential solutions to abstract or handle them.
Furthermore, we present an extensive evaluation of different compilation approaches in Umbra on a wide variety of x86-64 and ARM machines. In particular, we compare quantitative performance characteristics such as compilation latency and query throughput.
Our results show that using standard languages and compiler infrastructures reduces the barrier to employing query compilation and allows for high performance on big data sets, while domain-specific code generators can achieve a significantly lower compilation overhead and allow for better targeting of new architectures.
Publisher
Association for Computing Machinery (ACM)
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Reference41 articles.
1. Inc Amazon Web Services . 2022 . Factors affecting query performance. https://docs.aws.amazon.com/redshift/latest/dg/c-query-performance.html . Accessed : February 26, 2023. Inc Amazon Web Services. 2022. Factors affecting query performance. https://docs.aws.amazon.com/redshift/latest/dg/c-query-performance.html. Accessed: February 26, 2023.
2. System R
3. Brian Babcock Shivnath Babu Mayur Datar Rajeev Motwani and Jennifer Widom. 2002. Models and Issues in Data Stream Systems. In PODS. ACM 1--16. Brian Babcock Shivnath Babu Mayur Datar Rajeev Motwani and Jennifer Widom. 2002. Models and Issues in Data Stream Systems. In PODS. ACM 1--16.
4. A history and evaluation of System R
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Query Compilation Without Regrets;Proceedings of the ACM on Management of Data;2024-05-29
2. Incremental Fusion: Unifying Compiled and Vectorized Query Execution;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
3. Compile-Time Analysis of Compiler Frameworks for Query Compilation;2024 IEEE/ACM International Symposium on Code Generation and Optimization (CGO);2024-03-02
4. Analyzing Vectorized Hash Tables across CPU Architectures;Proceedings of the VLDB Endowment;2023-07