Affiliation:
1. The Ohio State University, USA
Abstract
The commoditization of high-performance networking has sparked research interest in the RDMA capability of this hardware. One-sided RDMA primitives, in particular, have generated substantial excitement due to the ability to directly access remote memory from within an application without involving the TCP/IP stack or the remote CPU. This article considers how to leverage RDMA to improve the analytical performance of parallel database systems. To shuffle data efficiently using RDMA, one needs to consider a complex design space that includes (1) the number of open connections, (2) the contention for the shared network interface, (3) the RDMA transport function, and (4) how much memory should be reserved to exchange data between nodes during query processing. We contribute eight designs that capture salient tradeoffs in this design space as well as an adaptive algorithm to dynamically manage RDMA-registered memory. We comprehensively evaluate how transport-layer decisions impact the query performance of a database system for different generations of InfiniBand. We find that a shuffling operator that uses the RDMA Send/Receive transport function over the Unreliable Datagram transport service can transmit data up to 4× faster than an RDMA-capable MPI implementation in a 16-node cluster. The response time of TPC-H queries improves by as much as 2×.
Funder
National Science Foundation
Google Research Faculty Award
Publisher
Association for Computing Machinery (ACM)
Reference60 articles.
1. Accelio. 2019. Accelio. Retrieved from http://www.accelio.org/. Accelio. 2019. Accelio. Retrieved from http://www.accelio.org/.
2. Attack of the killer microseconds
3. Distributed join algorithms on thousands of cores
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Agent-Based Parallelization of a Multi-Dimensional Semantic Database Model;2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI);2023-08
2. Towards Accelerating Data Intensive Application's Shuffle Process Using SmartNICs;Proceedings of the ACM on Measurement and Analysis of Computing Systems;2023-05-19
3. A Survey of Storage Systems in the RDMA Era;IEEE Transactions on Parallel and Distributed Systems;2022-12-01
4. High-Performance Big Data Computing;2022-08-02
5. Redy;Proceedings of the VLDB Endowment;2021-12