Mitigating Prefetcher-Caused Pollution Using Informed Caching Policies for Prefetched Blocks-Reference-Cited by-同舟云学术

Mitigating Prefetcher-Caused Pollution Using Informed Caching Policies for Prefetched Blocks

Published:2015-01-09 Issue:4 Volume:11 Page:1-22
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Seshadri Vivek¹,Yedkar Samihan¹,Xin Hongyi¹,Mutlu Onur¹,Gibbons Phillip B.²,Kozuch Michael A.²,Mowry Todd C.¹

Affiliation:

1. Carnegie Mellon University, Pittsburgh PA

2. Intel Pittsburgh, Pittsburgh PA

Abstract

Many modern high-performance processors prefetch blocks into the on-chip cache. Prefetched blocks can potentially pollute the cache by evicting more useful blocks. In this work, we observe that both accurate and inaccurate prefetches lead to cache pollution, and propose a comprehensive mechanism to mitigate prefetcher-caused cache pollution. First, we observe that over 95% of useful prefetches in a wide variety of applications are not reused after the first demand hit (in secondary caches). Based on this observation, our first mechanism simply demotes a prefetched block to the lowest priority on a demand hit. Second, to address pollution caused by inaccurate prefetches, we propose a self-tuning prefetch accuracy predictor to predict if a prefetch is accurate or inaccurate. Only predicted-accurate prefetches are inserted into the cache with a high priority. Evaluations show that our final mechanism, which combines these two ideas, significantly improves performance compared to both the baseline LRU policy and two state-of-the-art approaches to mitigating prefetcher-caused cache pollution (up to 49%, and 6% on average for 157 two-core multiprogrammed workloads). The performance improvement is consistent across a wide variety of system configurations.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/2677956

Reference51 articles.

1. Alaa R. Alameldeen and David A. Wood. 2007. Interactions between compression and prefetching in chip multiprocessors. In HPCA. 10.1109/HPCA.2007.346200 Alaa R. Alameldeen and David A. Wood. 2007. Interactions between compression and prefetching in chip multiprocessors. In HPCA. 10.1109/HPCA.2007.346200

2. Jorge Albericio Pablo Ibáñez Víctor Viñals and José M. Llabería. 2013. The reuse cache: downsizing the shared last-level cache. In MICRO. 10.1145/2540708.2540735 Jorge Albericio Pablo Ibáñez Víctor Viñals and José M. Llabería. 2013. The reuse cache: downsizing the shared last-level cache. In MICRO. 10.1145/2540708.2540735

3. Susanne Albers and Markus Büttner. 2003. Integrated prefetching and caching in single and parallel disk systems. In SPAA. 10.1145/777412.777431 Susanne Albers and Markus Büttner. 2003. Integrated prefetching and caching in single and parallel disk systems. In SPAA. 10.1145/777412.777431

4. Jean-Loup Baer and Tien-Fu Chen. 1995. Effective hardware-based data prefetching for high-performance processors. IEEE TC (1995). 10.1109/12.381947 Jean-Loup Baer and Tien-Fu Chen. 1995. Effective hardware-based data prefetching for high-performance processors. IEEE TC (1995). 10.1109/12.381947

Cited by 35 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Two Level Neural Approach Combining Off-Chip Prediction with Adaptive Prefetch Filtering;2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2024-03-02

2. CLIP: Load Criticality based Data Prefetching for Bandwidth-constrained Many-core Systems;56th Annual IEEE/ACM International Symposium on Microarchitecture;2023-10-28

3. ZPP: A Dynamic Technique to Eliminate Cache Pollution in NoC based MPSoCs;ACM Transactions on Embedded Computing Systems;2023-09-09

4. A Prefetch-Adaptive Intelligent Cache Replacement Policy Based on Machine Learning;Journal of Computer Science and Technology;2023-03-30

5. Criticality-aware priority to accelerate GPU memory access;The Journal of Supercomputing;2022-07-06