Affiliation:
1. University of Illinois at Urbana-Champaign, Urbana, IL
Abstract
The unique characteristics of scientific data and queries cause traditional indexing techniques to perform poorly on scientific workloads, occupy excessive space, or both. Refinements of bitmap indexes have been proposed previously as a solution to this problem. In this article, we describe the difficulties we encountered in deploying bitmap indexes with scientific data and queries from two real-world domains. In particular, previously proposed methods of binning, encoding, and compressing bitmap vectors either were quite slow for processing the large-range query conditions our scientists used, or required excessive storage space. Nor could the indexes easily be built or used on parallel platforms. In this article, we show how to solve these problems through the use of
multi-resolution, parallelizable
bitmap indexes, which support a fine-grained trade-off between storage requirements and query performance. Our experiments with large data sets from two scientific domains show that multi-resolution, parallelizable bitmap indexes occupy an acceptable amount of storage while improving range query performance by roughly a factor of 10, compared to a single-resolution bitmap index of reasonable size.
Publisher
Association for Computing Machinery (ACM)
Cited by
59 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Hierarchical Bitmap Indexing for Range Queries on Multidimensional Arrays;Database Systems for Advanced Applications;2022
2. Multivariate Probabilistic Range Queries for Scalable Interactive 3D Visualization;IEEE Transactions on Visualization and Computer Graphics;2022
3. Adaptive Spatially Aware I/O for Multiresolution Particle Data Layouts;2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2021-05
4. MoHA: A Composable System for Efficient In-Situ Analytics on Heterogeneous HPC Systems;SC20: International Conference for High Performance Computing, Networking, Storage and Analysis;2020-11
5. Optimizing bitmap index encoding for high performance queries;Concurrency and Computation: Practice and Experience;2020-09-07