Affiliation:
1. ETH Zurich, Switzerland
2. TU Dortmund University, Germany
3. University of Waterloo, Canada
Abstract
In this paper we experimentally study the performance of main-memory, parallel, multi-core join algorithms, focusing on sort-merge and (radix-)hash join. The relative performance of these two join approaches have been a topic of discussion for a long time. With the advent of modern multi-core architectures, it has been argued that sort-merge join is now a better choice than radix-hash join. This claim is justified based on the width of SIMD instructions (sort-merge outperforms radix-hash join once SIMD is sufficiently wide), and NUMA awareness (sort-merge is superior to hash join in NUMA architectures). We conduct extensive experiments on the original and optimized versions of these algorithms. The experiments show that, contrary to these claims, radix-hash join is still clearly superior, and sort-merge approaches to performance of radix only when very large amounts of data are involved. The paper also provides the fastest implementations of these algorithms, and covers many aspects of modern hardware architectures relevant not only for joins but for any parallel data processing operator.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
160 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Optimization of Sparse Matrix Computation for Algebraic Multigrid on GPUs;ACM Transactions on Architecture and Code Optimization;2024-09-14
2. SIMDified Data Processing - Foundations, Abstraction, and Advanced Techniques;Companion of the 2024 International Conference on Management of Data;2024-06-09
3. How Does Software Prefetching Work on GPU Query Processing?;Proceedings of the 20th International Workshop on Data Management on New Hardware;2024-06-09
4. CPU and GPU Hash Joins on Skewed Data;2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW);2024-05-13
5. Data-centric workloads with MPI_Sort;Journal of Parallel and Distributed Computing;2024-05