Affiliation:
1. ETH Zurich, Zurich, Switzerland
Abstract
Online-Analytical Processing (OLAP) has been a field of competing technologies for the past ten years. One of the still unsolved challenges of OLAP is how to provide quick response times on
any
Terabyte-sized business data problem. Recently, a very clever multi-dimensional index structure termed Dwarf [26] has been proposed offering excellent query response times as well as unmatched index compression rates. The proposed index seems to scale well for both large data sets as well as high dimensions. Motivated by these surprisingly excellent results, we take a look into the rearview mirror. We have re-implemented the Dwarf index from scratch and make three contributions. First, we successfully repeat several of the experiments of the original paper. Second, we substantially correct some of the experimental results reported by the inventors. Some of our results differ by orders of magnitude. To better understand these differences, we provide additional experiments that better explain the behavior of the Dwarf index. Third, we provide
missing
experiments comparing Dwarf to baseline query processing strategies. This should give practitioners a better guideline to understand for which cases Dwarf indexes could be useful in practice.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. The Big Data Analytics Group at Saarland University;Datenbank-Spektrum;2022-10-07
2. Efficient Evaluation of Multi-Column Selection Predicates in Main-Memory;IEEE Transactions on Knowledge and Data Engineering;2019-07-01
3. Eunomia: Scaling Concurrent Index Structures Under Contention Using HTM;IEEE Transactions on Parallel and Distributed Systems;2018-08-01
4. Accelerating Multi-Column Selection Predicates in Main-Memory - The Elf Approach;2017 IEEE 33rd International Conference on Data Engineering (ICDE);2017-04
5. Eunomia;Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming;2017-01-26