Leveraging Glocality for Fast Failure Recovery in Distributed RAM Storage

Author:

Zhang Yiming1ORCID,Li Dongsheng1,Liu Ling2

Affiliation:

1. National University of Defense Technology, Changsha, China

2. Georgia Institute of Technology, Atlanta, USA

Abstract

Distributed RAM storage aggregates the RAM of servers in data center networks (DCN) to provide extremely high I/O performance for large-scale cloud systems. For quick recovery of storage server failures, MemCube [53] exploits the proximity of the BCube network to limit the recovery traffic to the recovery servers’ 1-hop neighborhood. However, the previous design is applicable only to the symmetric BCube( n , k ) network with n k +1 nodes and has suboptimal recovery performance due to congestion and contention. To address these problems, in this article, we propose CubeX, which (i) generalizes the “1-hop” principle of MemCube for arbitrary cube-based networks and (ii) improves the throughput and recovery performance of RAM-based key-value (KV) store via cross-layer optimizations. At the core of CubeX is to leverage the glocality (= globality + locality) of cube-based networks: It scatters backup data across a large number of disks globally distributed throughout the cube and restricts all recovery traffic within the small local range of each server node. Our evaluation shows that CubeX not only efficiently supports RAM-based KV store for cube-based networks but also significantly outperforms MemCube and RAMCloud in both throughput and recovery time.

Funder

National Natural Science Foundation of China

NSF SaTC

IBM faculty award

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture

Reference54 articles.

1. AWS Team. Summary of the Amazon EC2 and Amazon RDS Service Disruption in the US East Region. Retrieved from http://aws.amazon.com/message/65648/. AWS Team. Summary of the Amazon EC2 and Amazon RDS Service Disruption in the US East Region. Retrieved from http://aws.amazon.com/message/65648/.

2. NiceX Lab. Ursa Block Store. Retrieved from http://nicexlab.com/ursa/. NiceX Lab. Ursa Block Store. Retrieved from http://nicexlab.com/ursa/.

3. RedisLabs. Redis Official Website. Retrieved from http://redis.io/. RedisLabs. Redis Official Website. Retrieved from http://redis.io/.

4. Dhruba Borthakur. HDFS Architecture Guide. Retrieved from https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html. Dhruba Borthakur. HDFS Architecture Guide. Retrieved from https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html.

Cited by 15 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Hybrid Block Storage for Efficient Cloud Volume Service;ACM Transactions on Storage;2023-10-03

2. Fast CU patition based on image similarity using neural network;Multimedia Tools and Applications;2023-09-26

3. Accelerating QTMT-based CU partition and intra mode decision for versatile video coding;Journal of Visual Communication and Image Representation;2023-06

4. Oasis : Controlling Data Migration in Expansion of Object-based Storage Systems;ACM Transactions on Storage;2023-01-19

5. The ZZ domain of HERC2 is a receptor of arginylated substrates;Scientific Reports;2022-04-11

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3