Affiliation:
1. The University of Texas, Austin
Abstract
Growing wire delays will force substantive changes in the designs of large caches. Traditional cache architectures assume that each level in the cache hierarchy has a single, uniform access time. Increases in on-chip communication delays will make the hit time of large on-chip caches a function of a line's physical location within the cache. Consequently, cache access times will become a continuum of latencies rather than a single discrete latency. This non-uniformity can be exploited to provide faster access to cache lines in the portions of the cache that reside closer to the processor. In this paper, we evaluate a series of cache designs that provides fast hits to multi-megabyte cache memories. We first propose physical designs for these Non-Uniform Cache Architectures (NUCAs). We extend these physical designs with logical policies that allow important data to migrate toward the processor within the same level of the cache. We show that, for multi-megabyte level-two caches, an adaptive, dynamic NUCA design achieves 1.5 times the IPC of a Uniform Cache Architecture of any size, outperforms the best static NUCA scheme by 11%, outperforms the best three-level hierarchy--while using less silicon area--by 13%, and comes within 13% of an ideal minimal hit latency solution.
Publisher
Association for Computing Machinery (ACM)
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Characterizing a Memory Allocator at Warehouse Scale;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3;2024-04-27
2. edAttack: Hardware Trojan Attack on On-Chip Packet Compression;IEEE Design & Test;2023-12
3. Affinity Alloc: Taming Not-So Near-Data Computing;56th Annual IEEE/ACM International Symposium on Microarchitecture;2023-10-28
4. ZPP: A Dynamic Technique to Eliminate Cache Pollution in NoC based MPSoCs;ACM Transactions on Embedded Computing Systems;2023-09-09
5. DeepNR: An adaptive deep reinforcement learning based NoC routing algorithm;Microprocessors and Microsystems;2022-04