BP-Tree: Overcoming the Point-Range Operation Tradeoff for In-Memory B-Trees

Author:

Xu Helen1,Li Amanda2,Wheatman Brian3,Marneni Manoj4,Pandey Prashant4

Affiliation:

1. Lawrence Berkeley, National Laboratory

2. Massachusetts Institute of Technology

3. Johns, Hopkins University

4. University of Utah

Abstract

B-trees are the go-to data structure for in-memory indexes in databases and storage systems. B-trees support both point operations (i.e., inserts and finds) and range operations (i.e., iterators and maps). However, there is an inherent tradeoff between point and range operations since the optimal node size for point operations is much smaller than the optimal node size for range operations. Existing implementations use a relatively small node size to achieve fast point operations at the cost of range operation throughput. We present the BP-tree , a variant of the B-tree, that overcomes the decades-old point-range operation tradeoff in traditional B-trees. In the BP-tree, the leaf nodes are much larger in size than the internal nodes to support faster range scans. To avoid any slowdown in point operations due to large leaf nodes, we introduce a new insert-optimized array called the buffered partitioned array (BPA) to efficiently organize data in leaf nodes. The BPA supports fast insertions by delaying ordering the keys in the array. This results in much faster range operations and faster point operations at the same time in the BP-tree. Our experiments show that on 48 hyperthreads, on workloads generated from the Yahoo! Cloud Serving Benchmark (YCSB), the BP-tree supports similar or faster point operation throughput (between .94×-1.2× faster) compared to Masstree and OpenBw-tree, two state-of-the-art in-memory key-value (KV) stores. On a YCSB workload with short scans, the BP-tree is about 7.4× faster than Masstree and 1.6× faster than OpenBw-tree. Furthermore, we extend the YCSB to add large range workloads, commonly found in database applications, and show that the BP-tree is 30× faster than Masstree and 2.5× faster than OpenBw-tree. We also provide a reference implementation for a concurrent B + -tree and find that the BP-tree supports faster (between 1.03×-1.2× faster) point operations when compared to the best-case configuration for B + -trees for point operations while supporting similar performance (about .95× as fast) on short range operations and faster (about 1.3× faster) long range operations.

Publisher

Association for Computing Machinery (ACM)

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Reference82 articles.

1. The input/output complexity of sorting and related problems

2. Amazon. [n.d.]. Amazon Web Services. https://aws.amazon.com/. Last accessed 11/1/2022. Amazon. [n.d.]. Amazon Web Services. https://aws.amazon.com/. Last accessed 11/1/2022.

3. New Algorithms for Disk Scheduling

4. Lars Arge . 2005 . The buffer tree: A new technique for optimal I/O-algorithms . In Algorithms and Data Structures: 4th International Workshop, WADS'95 Kingston, Canada, August 16--18, 1995 Proceedings. Springer, 334--345 . Lars Arge. 2005. The buffer tree: A new technique for optimal I/O-algorithms. In Algorithms and Data Structures: 4th International Workshop, WADS'95 Kingston, Canada, August 16--18, 1995 Proceedings. Springer, 334--345.

5. LinkBench

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3