WALTZ: Leveraging Zone Append to Tighten the Tail Latency of LSM Tree on ZNS SSD

Author:

Lee Jongsung1,Kim Donguk1,Lee Jae W.2

Affiliation:

1. Seoul National University and Samsung Electronics, Korea

2. Seoul National University, Korea

Abstract

We propose WALTZ, an LSM tree-based key-value store on the emerging Zoned Namespace (ZNS) SSD. The key contribution of WALTZ is to leverage the zone append command, which is a recent addition to ZNS SSD specifications, to provide tight tail latency. The long tail latency problem caused by the merging process of multiple parallel writes, called batch-group writes, is effectively addressed by the internal synchronization mechanism of ZNS SSD. To provide fast failover when the active zone becomes full for a write-ahead log (WAL) file during parallel append, WALTZ introduces a mechanism for WAL zone replacement and reservation. Finally, lazy metadata management allows a put query to be processed fast without requiring any other synchronizations to enable lock-free execution of individual append commands. For evaluation we use both mi-crobenchmarks (db_bench) with varying read/write ratios and key skewnesses, and realistic social-graph workloads (MixGraph from Facebook). Our evaluation demonstrates geomean reduction of tail latency by 2.19× and 2.45× for db_bench and MixGraph, respectively, with a maximum reduction of 3.02× and 4.73×. As a side effect of eliminating the overhead of batch-group writes, WALTZ also improves the query throughput (QPS) by up to 11.7%.

Publisher

Association for Computing Machinery (ACM)

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Reference43 articles.

1. Apache. [online]. Apache Cassandra. https://cassandra.apache.org. [Accessed 25-07-2023]. Apache. [online]. Apache Cassandra. https://cassandra.apache.org. [Accessed 25-07-2023].

2. Apache. [online]. Apache HBase. https://hbase.apache.org. [Accessed 25-07-2023]. Apache. [online]. Apache HBase. https://hbase.apache.org. [Accessed 25-07-2023].

3. What you can't forget

4. Oana Balmau , Florin Dinu , Willy Zwaenepoel , Karan Gupta , Ravishankar Chandhiramoorthi , and Diego Didona . 2019 . SILK: Preventing Latency Spikes in Log-Structured Merge Key-Value Stores. In 2019 USENIX Annual Technical Conference (USENIX ATC 19) . USENIX Association, Renton, WA, 753--766. https://www.usenix.org/conference/atc19/presentation/balmau Oana Balmau, Florin Dinu, Willy Zwaenepoel, Karan Gupta, Ravishankar Chandhiramoorthi, and Diego Didona. 2019. SILK: Preventing Latency Spikes in Log-Structured Merge Key-Value Stores. In 2019 USENIX Annual Technical Conference (USENIX ATC 19). USENIX Association, Renton, WA, 753--766. https://www.usenix.org/conference/atc19/presentation/balmau

5. Matias Bjørling. 2019. From Open-Channel SSDs to Zoned Namespaces. In Linux Storage and File systems Conference (Vault'19). Matias Bjørling. 2019. From Open-Channel SSDs to Zoned Namespaces. In Linux Storage and File systems Conference (Vault'19).

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. ZWAL: RethinkingWrite-ahead Logs for ZNS SSDs with Zone Appends;ACM SIGOPS Operating Systems Review;2024-08-14

2. Bf-Tree: A Modern Read-Write-Optimized Concurrent Larger-Than-Memory Range Index;Proceedings of the VLDB Endowment;2024-07

3. ZWAL: Rethinking Write-ahead Logs for ZNS SSDs with Zone Appends;Proceedings of the 4th Workshop on Challenges and Opportunities of Efficient and Performant Storage Systems;2024-04-22

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3