Affiliation:
1. Chalmers University of Technology, Gothenburg, Sweden, Sweden
Abstract
This article describes Memory Squeeze (MemSZ), a new approach for lossy general-purpose memory compression. MemSZ introduces a low latency, parallel design of the Squeeze (SZ) algorithm offering aggressive compression ratios, up to 16:1 in our implementation. Our compressor is placed between the memory controller and the cache hierarchy of a processor to reduce the memory traffic of applications that tolerate approximations in parts of their data. Thereby, the available off-chip bandwidth is utilized more efficiently improving system performance and energy efficiency. Two alternative multi-core variants of the MemSZ system are described. The first variant has a shared last-level cache (LLC) on the processor-die, which is modified to store both compressed and uncompressed data. The second has a 3D-stacked DRAM cache with larger cache lines that match the granularity of the compressed memory blocks and stores only uncompressed data. For applications that tolerate aggressive approximation in large fractions of their data, MemSZ reduces baseline memory traffic by up to 81%, execution time by up to 62%, and energy costs by up to 25% introducing up to 1.8% error to the application output. Compared to the current state-of-the-art lossy memory compression design, MemSZ improves the execution time, energy, and memory traffic by up to 15%, 9%, and 64%, respectively.
Funder
Swedish Research Council under the ACE project
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Information Systems,Software
Reference63 articles.
1. NVIDIA. 2015. NVIDIA Tegra X1: NVIDIA’s New Mobile Superchip. whitepaper. NVIDIA. 2015. NVIDIA Tegra X1: NVIDIA’s New Mobile Superchip. whitepaper.
2. 2018. 1D K-Means Open Source. Retrieved October 13 2018 from https://github.com/eldstal/kmeans. 2018. 1D K-Means Open Source. Retrieved October 13 2018 from https://github.com/eldstal/kmeans.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Multifacets of lossy compression for scientific data in the Joint-Laboratory of Extreme Scale Computing;Future Generation Computer Systems;2025-02
2. A novel approximate cache block compressor for error-resilient image data;Computers and Electrical Engineering;2024-04
3. Smart-DNN+: A Memory-efficient Neural Networks Compression Framework for the Model Inference;ACM Transactions on Architecture and Code Optimization;2023-10-26
4. Stream Aggregation with Compressed Sliding Windows;ACM Transactions on Reconfigurable Technology and Systems;2023-06-20
5. FlatPack;Proceedings of the International Conference on Parallel Architectures and Compilation Techniques;2022-10-08