Perseid: A Secondary Indexing Mechanism for LSM-based Storage Systems

Author:

Wang Jing1,Lu Youyou1,Wang Qing1,Zhang Yuhao1,Shu Jiwu1

Affiliation:

1. Tsinghua University, China

Abstract

LSM-based storage systems are widely used for superior write performance on block devices. However, they currently fail to efficiently support secondary indexing, since a secondary index query operation usually needs to retrieve multiple small values, which scatter in multiple LSM components. In this work, we revisit secondary indexing in LSM-based storage systems with byte-addressable persistent memory (PM). Existing PM-based indexes are not directly competent for efficient secondary indexing. We propose Perseid , an efficient PM-based secondary indexing mechanism for LSM-based storage systems, which takes into account both characteristics of PM and secondary indexing. Perseid consists of (1) a specifically designed secondary index structure that achieves high-performance insertion and query, (2) a lightweight hybrid PM-DRAM and hash-based validation approach to filter out obsolete values with subtle overhead, and (3) two adapted optimizations on primary table searching issued from secondary indexes to accelerate non-index-only queries. Our evaluation shows that Perseid outperforms existing PM-based indexes by 3-7 × and achieves about two orders of magnitude performance of state-of-the-art LSM-based secondary indexing techniques even if on PM instead of disks.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture

Reference73 articles.

1. 2022. Apache Cassandra. https://cassandra.apache.org/. 2022. Apache Cassandra. https://cassandra.apache.org/.

2. 2022. Apache Cassandra: How are indexes stored and updated. https://docs.datastax.com/en/cassandra-oss/3.x/cassandra/dml/dmlIndexInternals.html. 2022. Apache Cassandra: How are indexes stored and updated. https://docs.datastax.com/en/cassandra-oss/3.x/cassandra/dml/dmlIndexInternals.html.

3. 2022. Chirp: A Twitter-like workload generator. http://alumni.cs.ucr.edu/~ameno002/benchmark/. 2022. Chirp: A Twitter-like workload generator. http://alumni.cs.ucr.edu/~ameno002/benchmark/.

4. 2022. Compute Express Link: The Breakthrough CPU-to-Device Interconnect. https://www.computeexpresslink.org/. 2022. Compute Express Link: The Breakthrough CPU-to-Device Interconnect. https://www.computeexpresslink.org/.

5. 2022. MongoDB. https://www.mongodb.com. 2022. MongoDB. https://www.mongodb.com.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3