Index Shipping for Efficient Replication in LSM Key-Value Stores with Hybrid KV Placement

Author:

Stylianakis Giorgos1ORCID,Saloustros Giorgos2ORCID,Chiotakis Orestis1ORCID,Xanthakis Giorgos1ORCID,Bilas Angelos1ORCID

Affiliation:

1. FORTH, Heraklion, Greece and University of Crete, Heraklion, Greece

2. FORTH, Heraklion, Greece

Abstract

Key-value (KV) stores based on the LSM tree have become a foundational layer in the storage stack of datacenters and cloud services. Current approaches for achieving reliability and availability favor reducing network traffic and send to replicas only new KV pairs. As a result, they perform costly compactions to reorganize data in both the primary and backup nodes, which increases device I/O traffic and CPU overhead, and eventually hurts overall system performance. In this article, we describe  Tebis , an efficient LSM-based KV store that reduces I/O amplification and CPU overhead for maintaining the replica index. We use a primary-backup replication scheme that performs compactions only on the primary nodes and sends pre-built indexes to backup nodes, avoiding all compactions in backup nodes. Our approach includes an efficient mechanism to deal with pointer translation across nodes in the pre-built region index. Our results show that Tebis reduces resource utilization on backup nodes compared to performing full compactions: throughput is increased by 1.06 to 2.90×, CPU efficiency is increased by 1.21 to 2.78×, and I/O amplification is reduced by 1.7 to 3.27×, whereas network traffic increases by up to 1.32 to 3.76x.

Funder

European High-Performance Computing Joint Undertaking

The EuroHPC JU

European Union’s Horizon 2020 research and innovation programme and France, Germany, Italy, Greece, United Kingdom, Czech Republic, and Croatia

European Commission under the European High-Performance Computing Joint Undertaking

Publisher

Association for Computing Machinery (ACM)

Reference51 articles.

1. Marcos K. Aguilera, Naama Ben-David, Rachid Guerraoui, Virendra J. Marathe, Athanasios Xygkis, and Igor Zablotchi. 2020. Microsecond Consensus for Microsecond Applications. USENIX Association.

2. HBase;R,2018

3. InfiniBand Trade Association. 2015. IB Specification Vol 1 03 2015. Release-1.3. InfiniBand Trade Association.

4. Aurelius. 2012. TitanDB. (June2012). Retrieved September 30 2021 from http://titan.thinkaurelius.com/

5. Nikos Batsaras Giorgos Saloustros Anastasios Papagiannis Panagiota Fatourou and Angelos Bilas. 2020. VAT: Asymptotic cost analysis for multi-level key-value stores. arxiv:cs.DC/2003.00103 (2020).

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3