An experimental study of group-by and aggregation on CPU-GPU processors-Reference-Cited by-同舟云学术

An experimental study of group-by and aggregation on CPU-GPU processors

Published:2022-06-22 Issue:1 Volume:69 Page:
ISSN:1110-1903
Container-title:Journal of Engineering and Applied Science
language:en
Short-container-title:J. Eng. Appl. Sci.

Author:

Luan Hua,Chang Lei

Abstract

AbstractHash-based group-by and aggregation is a fundamental operator in database systems. Modern discrete GPUs (graphics processing units) have been considered to accelerate the performance. However, the data transfer through the PCIe (peripheral component interconnect express) bus would reduce gains. On recent architectures, the GPU and the CPU (central processing unit) are built into the same chip which removes the data transmission and offers new performance opportunities. Yet there has been no systematic analysis of grouping and aggregation algorithms on such architectures. In this paper, we study the behaviors of various hash-based grouping and aggregation methods on coupled architectures to provide meaningful guidelines. We conduct an extensive experimental study and analysis on the single CPU, the coupled GPU, and both processors. Six dimensions are considered in analyzing the hashing methods carefully: (1) hashing scheme, (2) hash function, (3) data size, (4) group cardinality, (5) load factor, and (6) data distribution. Two additional dimensions are also explored: (7) shared and independent hash tables and (8) running on single processors and co-processing. We hope the results in our study could help database researchers to choose the right direction in terms of algorithm design and system optimization.

Funder

National Key Research and Development Program of China

Grant from the Capital Science and Technology Innovation Vouchers of China

Publisher

Springer Science and Business Media LLC

Subject

General Engineering

Link

https://link.springer.com/content/pdf/10.1186/s44147-022-00108-1.pdf

Reference36 articles.

1. Cieslewicz J, Ross KA (2007) Adaptive aggregation on chip multiprocessors In: Proceedings of the 33rd International Conference on Very Large Data Bases, University of Vienna, Austria, 23-27 September 2007, 339–350.. ACM, New York.

2. Müller I, Sanders P, Lacurie A, Lehner W, Färber F (2015) Cache-efficient aggregation: Hashing is sorting In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31 - June 04 2015, 1123–1136.. ACM, New York.

3. Ye Y, Ross KA, Vesdapunt N (2011) Scalable aggregation on multicore processors In: Proceedings of the Seventh International Workshop on Data Management on New Hardware, DaMoN 2011, Athens, Greece, 13 June 2011, 1–9.. ACM, New York.

4. Power J, Li Y, Hill DM, Patel MJ, Wood AD (2015) Toward GPUs being mainstream in analytic processing In: Proceedings of the 11th International Workshop on Data Management on New Hardware, DaMoN 2015, Melbourne, Victoria, Australia, May 31 - June 04 2015, 11:1–11:8.. ACM, New York.

5. Karnagel T, Müller R, Lohman MG (2015) Optimizing GPU-accelerated group-by and aggregation In: International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures - ADMS 2015, Kohala Coast, Hawaii, USA, 31 August 2015, 13–24.