The Case for Custom Storage Backends in Distributed Storage Systems

Author:

Aghayev Abutalib1,Weil Sage2,Kuchnik Michael1,Nelson Mark2,Ganger Gregory R.1,Amvrosiadis George1

Affiliation:

1. Carnegie Mellon University, Pittsbrugh, PA, USA

2. Red Hat, Inc., Raleigh, NC, USA

Abstract

For a decade, the Ceph distributed file system followed the conventional wisdom of building its storage backend on top of local file systems. This is a preferred choice for most distributed file systems today, because it allows them to benefit from the convenience and maturity of battle-tested code. Ceph’s experience, however, shows that this comes at a high price. First, developing a zero-overhead transaction mechanism is challenging. Second, metadata performance at the local level can significantly affect performance at the distributed level. Third, supporting emerging storage hardware is painstakingly slow. Ceph addressed these issues with BlueStore, a new backend designed to run directly on raw storage devices. In only two years since its inception, BlueStore outperformed previous established backends and is adopted by 70% of users in production. By running in user space and fully controlling the I/O stack, it has enabled space-efficient metadata and data checksums, fast overwrites of erasure-coded data, inline compression, decreased performance variability, and avoided a series of performance pitfalls of local file systems. Finally, it makes the adoption of backward-incompatible storage hardware possible, an important trait in a changing storage landscape that is learning to embrace hardware diversity.

Funder

NDSEG Fellowship

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture

Reference118 articles.

1. Amazon.com Inc. 2019. Amazon Elastic Block Store. Retrieved from https://aws.amazon.com/ebs/. Amazon.com Inc. 2019. Amazon Elastic Block Store. Retrieved from https://aws.amazon.com/ebs/.

2. Amazon.com Inc. 2019. Amazon S3. Retrieved from https://aws.amazon.com/s3/. Amazon.com Inc. 2019. Amazon S3. Retrieved from https://aws.amazon.com/s3/.

3. Jens Axboe. 2009. Queue sysfs Files. Retrieved from https://www.kernel.org/doc/Documentation/block/queue-sysfs.txt. Jens Axboe. 2009. Queue sysfs Files. Retrieved from https://www.kernel.org/doc/Documentation/block/queue-sysfs.txt.

Cited by 9 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Efficient Security Interface for High-Performance Ceph Storage System;2024

2. KV-CSD: A Hardware-Accelerated Key-Value Store for Data-Intensive Applications;2023 IEEE International Conference on Cluster Computing (CLUSTER);2023-10-31

3. The Open-Source DeLiBA2 Hardware/Software Framework for Distributed Storage Accelerators;ACM Transactions on Reconfigurable Technology and Systems;2023-09-14

4. Research on distributed competition big data hierarchical storage method based on Ceph architecture;International Conference on Signal Processing and Communication Security (ICSPCS 2022);2022-11-02

5. DeLiBA: An Open-Source Hardware/Software Framework for the Development of Linux Block I/O Accelerators;2022 32nd International Conference on Field-Programmable Logic and Applications (FPL);2022-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3