Affiliation:
1. Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Hubei, China
2. Department of Electrical and Computer Engineering, Virginia Commonwealth University
Abstract
To satisfy the explosive growth of data in large-scale data centers, where redundant arrays of independent disks (RAIDs), especially RAID-5, are widely deployed, effective storage scaling and disk expansion methods are desired. However, a way to reduce the data migration overhead and maintain the reliability of the original RAID are major concerns of storage scaling. To address these problems, we propose a new RAID scaling scheme, H-Scale, to achieve fast RAID scaling via hybrid stripe layouts. H-Scale takes advantage of the loose restriction of stripe structures to choose migrated data and to create hybrid stripe structures. The main advantages of our scheme include: (1) dramatically reducing the data migration overhead and thus speeding up the scaling process, (2) maintaining the original RAID’s reliability, (3) balancing the workload among disks after scaling, and (4) providing a general scaling approach for different RAID levels. Our theoretical analysis show that H-Scale outperforms existing scaling solutions in terms of data migration, I/O overheads, and parity update operations. Evaluation results on a prototype implementation demonstrate that H-Scale speeds up the online scaling process by up to 60% under SPC traces, and similar improvements on scaling time and user response time are also achieved by evaluations using standard benchmarks.
Funder
National Natural Science Foundation of China
Director Fund of WNLO
Key Laboratory of Data Storage System
Ministry of Education
U.S. National Science Foundation
Fundamental Research Funds for the Central Universities HUST
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture
Reference43 articles.
1. Random RAIDs with selective exploitation of redundancy for high performance video servers
2. Dynamic and Redundant Data Placement
3. Efficient, distributed data placement strategies for storage area networks (extended abstract)
4. Neil Brown. 2007. Online RAID-5 resizing. drivers/md/ raid5.c in the source code of Linux Kernel 2.6.23. (2007). http://neil.brown.name/blog/mdadm. Neil Brown. 2007. Online RAID-5 resizing. drivers/md/ raid5.c in the source code of Linux Kernel 2.6.23. (2007). http://neil.brown.name/blog/mdadm.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Achieving Tunable Erasure Coding with Cluster-Aware Redundancy Transitioning;ACM Transactions on Architecture and Code Optimization;2024-09-14
2. Storage Reliability;Data Storage Architectures and Technologies;2024
3. Accelerate RAID scaling by reducing disk I/Os and XOR operations;Proceedings of the 3rd International Conference on High Performance Compilation, Computing and Communications;2019-03-08
4. A Fast Approach to Scale Up Disk Arrays With Parity Declustered Data Layout by Minimizing Data Migration;IEEE Access;2019