Hybrid cache architecture with disparate memory technologies-Reference-Cited by-同舟云学术

Hybrid cache architecture with disparate memory technologies

Published:2009-06-15 Issue:3 Volume:37 Page:34-45
ISSN:0163-5964
Container-title:ACM SIGARCH Computer Architecture News
language:en
Short-container-title:SIGARCH Comput. Archit. News

Author:

Wu Xiaoxia¹,Li Jian²,Zhang Lixin²,Speight Evan²,Rajamony Ram²,Xie Yuan¹

Affiliation:

1. Pennsylvania State University, University Park, PA, USA

2. IBM Austin Research Lab, Austin, TX, USA

Abstract

Caching techniques have been an efficient mechanism for mitigating the effects of the processor-memory speed gap. Traditional multi-level SRAM-based cache hierarchies, especially in the context of chip multiprocessors (CMPs), present many challenges in area requirements, core-to-cache balance, power consumption, and design complexity. New advancements in technology enable caches to be built from other technologies, such as Embedded DRAM (EDRAM), Magnetic RAM (MRAM), and Phase-change RAM (PRAM), in both 2D chips or 3D stacked chips. Caches fabricated in these technologies offer dramatically different power and performance characteristics when compared with SRAM-based caches, particularly in the areas of access latency, cell density, and overall power consumption. In this paper, we propose to take advantage of the best characteristics that each technology offers, through the use of Hybrid Cache Architecture (HCA) designs. We discuss and evaluate two types of hybrid cache architectures: inter cache Level HCA (LHCA), in which the levels in a cache hierarchy can be made of disparate memory technologies; and intra cache level or cache Region based HCA (RHCA), where a single level of cache can be partitioned into multiple regions, each of a different memory technology. We have studied a number of different HCA architectures and explored the potential of hardware support for intra-cache data movement and power consumption management within HCA caches. Utilizing a full-system simulator that has been validated against real hardware, we demonstrate that an LHCA design can provide a geometric mean 7% IPC improvement over a baseline 3-level SRAM cache design under the same area constraint across a collection of 25 workloads. A more aggressive RHCA-based design provides 12% IPC improvement over the baseline. Finally, a 2-layer 3D cache stack (3DHCA) of high density memory technology within the same chip footprint gives 18% IPC improvement over the baseline. Furthermore, up to 70% reduction in power consumption over a baseline SRAM-only design is achieved.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/1555815.1555761

Reference30 articles.

1. BioPerf: A benchmark suite to evaluate high-performance computer architecture on bioinformatics applications

2. Managing Wire Delay in Large Chip-Multiprocessor Caches

3. The PARSEC benchmark suite

4. Mambo

Cited by 159 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Spin logic devices based on negative differential resistance-enhanced anomalous Hall effect;International Journal of Minerals, Metallurgy and Materials;2024-05-27

2. Security Advantages and Challenges of 3D Heterogeneous Integration;Computer;2024-03

3. Neuromorphic Computing between Reality and Future Needs;Neuromorphic Computing;2023-11-15

4. Hybrid, Asymmetric and Reconfigurable Input Unit Designs for Energy-Efficient On-Chip Networks;IEICE Transactions on Electronics;2023-10-01

5. Effective Stack Wear Leveling for NVM;IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems;2023-10