Adaptive Cache Compression for High-Performance Processors-Reference-Cited by-同舟云学术

Adaptive Cache Compression for High-Performance Processors

Published:2004-03-02 Issue:2 Volume:32 Page:212
ISSN:0163-5964
Container-title:ACM SIGARCH Computer Architecture News
language:en
Short-container-title:SIGARCH Comput. Archit. News

Author:

Alameldeen Alaa R.¹,Wood David A.¹

Affiliation:

1. University of Wisconsin-Madison

Abstract

Modern processors use two or more levels ofcache memories to bridge the rising disparity betweenprocessor and memory speeds. Compression canimprove cache performance by increasing effectivecache capacity and eliminating misses. However,decompressing cache lines also increases cache accesslatency, potentially degrading performance.In this paper, we develop an adaptive policy thatdynamically adapts to the costs and benefits of cachecompression. We propose a two-level cache hierarchywhere the L1 cache holds uncompressed data and the L2cache dynamically selects between compressed anduncompressed storage. The L2 cache is 8-way set-associativewith LRU replacement, where each set can storeup to eight compressed lines but has space for only fouruncompressed lines. On each L2 reference, the LRUstack depth and compressed size determine whethercompression (could have) eliminated a miss or incurs anunnecessary decompression overhead. Based on thisoutcome, the adaptive policy updates a single globalsaturating counter, which predicts whether to allocatelines in compressed or uncompressed form.We evaluate adaptive cache compression usingfull-system simulation and a range of benchmarks. Weshow that compression can improve performance formemory-intensive commercial workloads by up to 17%.However, always using compression hurts performancefor low-miss-rate benchmarks-due to unnecessarydecompression overhead-degrading performance byup to 18%. By dynamically monitoring workload behavior,the adaptive policy achieves comparable benefitsfrom compression, while never degrading performanceby more than 0.4%.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/1028176.1006719

Reference42 articles.

1. Effective algorithms for cache-level compression

2. Simulating a $2M commercial server on a $2K PC

3. Generating representative Web workloads for network and server performance evaluation

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing RISC-V Processor Performance in Harsh Environments through Data Cache Optimization;2024 Panhellenic Conference on Electronics & Telecommunications (PACET);2024-03-28

2. DaeMon: Architectural Support for Efficient Data Movement in Fully Disaggregated Systems;Proceedings of the ACM on Measurement and Analysis of Computing Systems;2023-02-27

3. A Compression Router for Low-Latency Network-on-Chip;IEICE Transactions on Information and Systems;2023-02-01

4. High Performance Instruction Fetch Structure within a RISC-V Processor for Use in Harsh Environments;Lecture Notes in Computer Science;2023

5. Gray counters for non-volatile memories;Memories - Materials, Devices, Circuits and Systems;2022-10