Affiliation:
1. University of California, Davis
2. Harvard University
Abstract
We demonstrate an efficient data-parallel algorithm for building large hash tables of millions of elements in real-time. We consider two parallel algorithms for the construction: a classical sparse perfect hashing approach, and cuckoo hashing, which packs elements densely by allowing an element to be stored in one of multiple possible locations. Our construction is a hybrid approach that uses both algorithms. We measure the construction time, access time, and memory usage of our implementations and demonstrate real-time performance on large datasets: for 5 million key-value pairs, we construct a hash table in 35.7 ms using 1.42 times as much memory as the input data itself, and we can access all the elements in that hash table in 15.3 ms. For comparison, sorting the same data requires 36.6 ms, but accessing all the elements via binary search requires 79.5 ms. Furthermore, we show how our hashing methods can be applied to two graphics applications: 3D surface intersection for moving data and geometric hashing for image matching.
Funder
National Science Foundation
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design
Cited by
48 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Efficient Parallel Processing of R-Tree on GPUs;Mathematics;2024-07-05
2. Minuet: Accelerating 3D Sparse Convolutions on GPUs;Proceedings of the Nineteenth European Conference on Computer Systems;2024-04-22
3. Computing Group-By and Aggregates on Massively Parallel Systems;2023 IEEE International Conference on Big Data (BigData);2023-12-15
4. RTIndeX: Exploiting Hardware-Accelerated GPU Raytracing for Database Indexing;Proceedings of the VLDB Endowment;2023-09
5. Efficient GPU-Accelerated Subgraph Matching;Proceedings of the ACM on Management of Data;2023-06-13