Early Address Prediction-Reference-Cited by-同舟云学术

Early Address Prediction

Published:2021-06 Issue:3 Volume:18 Page:1-22
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Alves Ricardo¹,Kaxiras Stefanos¹,Black-Schaffer David¹

Affiliation:

1. Uppsala University, Uppsala, Sweden

Abstract

Achieving low load-to-use latency with low energy and storage overheads is critical for performance. Existing techniques either prefetch into the pipeline (via address prediction and validation) or provide data reuse in the pipeline (via register sharing or L0 caches). These techniques provide a range of tradeoffs between latency, reuse, and overhead. In this work, we present a pipeline prefetching technique that achieves state-of-the-art performance and data reuse without additional data storage, data movement, or validation overheads by adding address tags to the register file. Our addition of register file tags allows us to forward (reuse) load data from the register file with no additional data movement, keep the data alive in the register file beyond the instruction’s lifetime to increase temporal reuse, and coalesce prefetch requests to achieve spatial reuse. Further, we show that we can use the existing memory order violation detection hardware to validate prefetches and data forwards without additional overhead. Our design achieves the performance of existing pipeline prefetching while also forwarding 32% of the loads from the register file (compared to 15% in state-of-the-art register sharing), delivering a 16% reduction in L1 dynamic energy (1.6% total processor energy), with an area overhead of less than 0.5%.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3458883

Reference41 articles.

1. Dynamically Disabling Way-prediction to Reduce Instruction Replay

2. Addressing Energy Challenges in Filter Caches

3. Filter caching for free

4. Flexible register management using reference counting

5. Correlated load-address predictors

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A prefetching indexing scheme for in-memory database systems;Future Generation Computer Systems;2024-07

2. Doppelganger Loads: A Safe, Complexity-Effective Optimization for Secure Speculation Schemes;Proceedings of the 50th Annual International Symposium on Computer Architecture;2023-06-17

3. Register file prefetching;Proceedings of the 49th Annual International Symposium on Computer Architecture;2022-06-11