Affiliation:
1. College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
Abstract
Lossless data compression is a crucial and computing-intensive application in data-centric scenarios. To reduce the CPU overhead, FPGA-based accelerators have been proposed to offload compression workloads. However, most existing schemes have the problem of an imbalanced resource utilization and a poor practicability. In this paper, we propose HybriDC, an adaptive resource-efficient CPU-FPGA heterogeneous acceleration system for lossless data compression. Leveraging complementary advantages of the heterogeneous architecture, HybriDC provides a universal end-to-end compression acceleration framework with application compatibility and performance scalability. To optimize the hardware compression kernel design, we build a performance–resource model of the compression algorithm taking into account the design goal, compression performance, available resources, etc. According to the deduced resource-balanced design principle, the compression algorithm parameters are fine-tuned, which reduces 32% of the block RAM usage of the LZ4 kernel. In the parallel compression kernel implementation, a memory-efficient parallel hash table with an extra checksum is proposed, which supports parallel processing and improves the compression ratio without extra memory. We develop an LZ4-based HybriDC system prototype and evaluate it in detail. Our LZ4 compression kernel achieves state-of-the-art memory efficiency, 2.5–4× better than existing designs with comparable compression ratios. The evaluation of total resource utilization and end-to-end throughput demonstrates the excellent scalability of HybriDC. In power efficiency, the four-kernel HybriDC prototype achieves a threefold advantage over the standard LZ4 algorithm.
Funder
National Natural Science Foundation of China
Subject
Electrical and Electronic Engineering,Mechanical Engineering,Control and Systems Engineering
Reference31 articles.
1. A Survey on Lossless and Lossy Data Compression Methods;Kavitha;Int. J. Comput. Sci. Eng. Technol. (IJCSET),2016
2. SlimCache: An Efficient Data Compression Scheme for Flash-Based Key-Value Caching;Jia;ACM Trans. Storage,2020
3. Ferraro Petrillo, U., Palini, F., Cattaneo, G., and Giancarlo, R. (2021). FASTA/Q data compressors for MapReduce-Hadoop genomics: Space and time savings made easy. BMC Bioinform., 22.
4. Agostini, M., O’Brien, F., and Abdelrahman, T. (2020, January 17–20). Balancing Graph Processing Workloads Using Work Stealing on Heterogeneous CPU-FPGA Systems. Proceedings of the ICPP ’20: 49th International Conference on Parallel Processing—ICPP, Edmonton, AB, Canada.
5. Accelerating data filtering for database using FPGA;Sun;J. Syst. Archit.,2021