Abstract
Visual data exploration tools allow users to quickly gather insights from new datasets. As dataset sizes continue to increase, though, new techniques will be necessary to maintain the interactivity guarantees that these tools require. Approximate query processing (AQP) attempts to tackle this problem and allows systems to return query results at "human speed." However, existing AQP techniques start to break down when confronted with ad hoc queries that target the tails of the distribution.
We therefore present an AQP formulation that can provide low-error approximate results at interactive speeds, even for queries over rare subpopulations. In particular, our formulation treats query results as random variables in order to leverage the ample opportunities for result reuse inherent in interactive data exploration. As part of our approach, we apply a variety of optimization techniques that are based on probability theory, including new query rewrite rules and index structures. We implemented these techniques in a prototype system and show that they can achieve interactivity where alternative approaches cannot.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
48 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Learned Optimizer for Online Approximate Query Processing in Data Exploration;IEEE Transactions on Knowledge and Data Engineering;2024-08
2. Optimizing Dataflow Systems for Scalable Interactive Visualization;Proceedings of the ACM on Management of Data;2024-03-12
3. Approximate Query Processing Based on Approximate Materialized View;Lecture Notes in Computer Science;2024
4. Tuple Bubbles: Learned Tuple Representations for Tunable Approximate Query Processing;Proceedings of the Sixth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management;2023-06-18
5. S/C: Speeding up Data Materialization with Bounded Memory;2023 IEEE 39th International Conference on Data Engineering (ICDE);2023-04