Affiliation:
1. Hong Kong University of Science and Technology
Abstract
One-dimensional range queries, as one of the most basic type of queries in databases, have been studied extensively in the literature. For large databases, the goal is to build an external index that is optimized for disk block accesses (or I/Os). The problem is well understood in the static case. Theoretically, there exists an index of linear size that can answer a range query in O(1 +
KB
) I/Os, where
K
is the output size and
B
is the disk block size, but it is highly impractical. In practice, the standard solution is the B-tree, which answers a query in
O
(log
B
NM
+
KB
) I/Os on a data set of size
N
, where
M
is the main memory size. For typical values of
N, M
, and
B
, log
B
NM
can be considered a constant.
However, the problem is still wide open in the dynamic setting, when insertions and deletions of records are to be supported. With smart buffering, it is possible to speed up updates significantly to
o
(1) I/Os amortized. Indeed, several dynamic B-trees have been proposed, but they all cause certain levels of degradation in the query performance, with the most interesting tradeoff point at
O
(1
B
log
NM
) I/Os for updates and
O
(log
NM
+
KB
) I/Os for queries. In this article, we prove that the query-update tradeoffs of all the known dynamic B-trees are optimal, when log
B
NM
is a constant. This implies that one should not hope for substantially better solutions for all practical values of the parameters. Our lower bounds hold in a dynamic version of the
indexability model
, which is of independent interests. Dynamic indexability is a clean yet powerful model for studying dynamic indexing problems, and can potentially lead to more interesting lower bound results.
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Hardware and Architecture,Information Systems,Control and Systems Engineering,Software
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Competitive Data-Structure Dynamization;ACM Transactions on Algorithms;2024-06-28
2. Data Structures for Data-Intensive Applications: Tradeoffs and Design Guidelines;Foundations and Trends® in Databases;2023
3. I/O-efficient 2-d orthogonal range skyline and attrition priority queues;Computational Geometry;2021-02
4. Fully persistent B-trees;Theoretical Computer Science;2020-11
5. Write-Optimized Skip Lists;Proceedings of the 36th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems;2017-05-09