Affiliation:
1. Aarhus University, Denmark
2. University of Ioannina, Greece
Abstract
The interval join is a basic operation that finds application in temporal, spatial, and uncertain databases. Although a number of centralized and distributed algorithms have been proposed for the efficient evaluation of interval joins, classic
plane sweep
approaches have not been considered at their full potential. A recent piece of related work proposes an optimized approach based on plane sweep (PS) for modern hardware, showing that it greatly outperforms previous work. However, this approach depends on the development of a complex data structure and its parallelization has not been adequately studied. In this paper, we explore the applicability of a largely ignored
forward scan
(FS) based plane sweep algorithm, which is extremely simple to implement. We propose two optimizations of FS that greatly reduce its cost, making it competitive to the state-of-the-art single-threaded PS algorithm while achieving a lower memory footprint. In addition, we show the drawbacks of a previously proposed hash-based partitioning approach for parallel join processing and suggest a domain-based partitioning approach that does not produce duplicate results. Within our approach we propose a novel breakdown of the partition join jobs into a small number of independent mini-join jobs with varying cost and manage to avoid redundant comparisons. Finally, we show how these mini-joins can be scheduled in multiple CPU cores and propose an adaptive domain partitioning, aiming at load balancing. We include an experimental study that demonstrates the efficiency of our optimized FS and the scalability of our parallelization framework.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
26 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Temporal subgraph matching method for multi-connected temporal graph;Information Sciences;2025-01
2. Independent Range Sampling on Interval Data;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
3. LIT: Lightning-fast In-memory Temporal Indexing;Proceedings of the ACM on Management of Data;2024-03-12
4. A New Primitive for Processing Temporal Joins;Proceedings of the 18th International Symposium on Spatial and Temporal Data;2023-08-23
5. Indexing Temporal Relations for Range-Duration Queries;35th International Conference on Scientific and Statistical Database Management;2023-07-10