Memory bandwidth limitations of future microprocessors-Reference-Cited by-同舟云学术

Memory bandwidth limitations of future microprocessors

Published:1996-05 Issue:2 Volume:24 Page:78-89
ISSN:0163-5964
Container-title:ACM SIGARCH Computer Architecture News
language:en
Short-container-title:SIGARCH Comput. Archit. News

Author:

Burger Doug¹,Goodman James R.¹,Kägi Alain¹

Affiliation:

1. Computer Sciences Department, University of Wisconsin-Madison, 1210 West Dayton Street, Madison, Wisconsin

Abstract

This paper makes the case that pin bandwidth will be a critical consideration for future microprocessors. We show that many of the techniques used to tolerate growing memory latencies do so at the expense of increased bandwidth requirements. Using a decomposition of execution time, we show that for modern processors that employ aggressive memory latency tolerance techniques, wasted cycles due to insufficient bandwidth generally exceed those due to raw memory latencies. Given the importance of maximizing memory bandwidth, we calculate effective pin bandwidth, then estimate optimal effective pin bandwidth. We measure these quantities by determining the amount by which both caches and minimal-traffic caches filter accesses to the lower levels of the memory hierarchy. We see that there is a gap that can exceed two orders of magnitude between the total memory traffic generated by caches and the minimal-traffic caches---implying that the potential exists to increase effective pin bandwidth substantially. We decompose this traffic gap into four factors, and show they contribute quite differently to traffic reduction for different benchmarks. We conclude that, in the short term, pin bandwidth limitations will make more complex on-chip caches cost-effective. For example, flexible caches may allow individual applications to choose from a range of caching policies. In the long term, we predict that off-chip accesses will be so expensive that all system memory will reside on one or more processor chips.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/232974.232983

Reference43 articles.

1. Algorithms for a Virtual- Storage Computer;Replacement L.A.;IBM Systems Journal

2. Software prefetching

Cited by 84 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Perspective: Entropy-stabilized oxide memristors;Applied Physics Letters;2024-08-12

2. A Fully Digital Relaxation-Aware Analog Programming Technique for HfOx RRAM Arrays;IEEE Transactions on Circuits and Systems II: Express Briefs;2024-08

3. Large-scale photonic inverse design: computational challenges and breakthroughs;Nanophotonics;2024-06-07

4. Puppeteer: A Random Forest Based Manager for Hardware Prefetchers Across the Memory Hierarchy;ACM Transactions on Architecture and Code Optimization;2022-12-16

5. Technical Difficulties and Development Trend;Software Defined Chips;2022-11-15