x Meta : SSD-HDD-hybrid Optimization for Metadata Maintenance of Cloud-scale Object Storage

Author:

Chen Yan1ORCID,Ke Qiwen2ORCID,Li Huiba3ORCID,Wu Yongwei4ORCID,Zhang Yiming5ORCID

Affiliation:

1. Tsinghua University, Beijing, China

2. NICEX Lab, Xiamen, China

3. Alibaba Group, Hangzhou, China

4. Tsinghua University, Beijing, China and Quan Cheng Laboratory, Jinan, China

5. NICE xLab, XMU, Xiamen, China

Abstract

Object storage has been widely used in the cloud. Traditionally, the size of object metadata is much smaller than that of object data, and thus existing object storage systems (such as Ceph and Oasis) can place object data and metadata, respectively, on hard disk drives (HDDs) and solid-state drives (SSDs) to achieve high I/O performance at a low monetary cost. Currently, however, a wide range of cloud applications organize their data as large numbers of small objects of which the data size is close to (or even smaller than) the metadata size, thus greatly increasing the cost if placing all metadata on expensive SSDs. This article presents x Meta , an SSD-HDD-hybrid optimization for metadata maintenance of cloud-scale object storage. We observed that a substantial portion of the metadata of small objects is rarely accessed and thus can be stored on HDDs with little performance penalty. Therefore, x Meta first classifies the hot and cold metadata based on the frequency of metadata accesses of upper-layer applications and then adaptively stores the hot metadata on SSDs and the cold metadata on HDDs. We also propose a merging mechanism for hot metadata to further improve the efficiency of SSD storage and optimize range key query and insertion for hot metadata by designing composite keys. We have integrated the x Meta metadata service with Ceph to realize a high-performance, low-cost object store (called xCeph). The extensive evaluation shows that xCeph outperforms the original Ceph by an order of magnitude in the space requirement of SSD storage, while improving the throughput by up to 2.7×.

Funder

National Key Research and Development Program of China

Alibaba Research Fellow Project

Publisher

Association for Computing Machinery (ACM)

Reference28 articles.

1. Ceph Team. Ceph Object Gateway. https://docs.ceph.com/en/pacific/radosgw

2. Delta lake

3. Qing Zheng Haopeng Chen Yaguang Wang Jiangang Duan and Zhiteng Huang. 2012. Cosbench: A benchmark tool for cloud object storage services. In 2012 IEEE Fifth International Conference on Cloud Computing. 998–999.

4. The Hadoop distributed file system: Architecture and design;Borthakur Dhruba;Hadoop Proj. Website,2007

5. Big metadata

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3