Enhancing Power Efficiency in Branch Target Buffer Design with a Two-Level Prediction Mechanism

Author:

Nian Jiawei12,Liu Hongjin23,Gao Xin23,Zhang Shaolin2,Yang Mengfei4

Affiliation:

1. School of Computer Science, and Technology, Xidian University, Xi’an 710071, China

2. Beijing SunWise Space Technology Ltd., Beijing 100190, China

3. Beijing Institute of Control Engineering, Beijing 100190, China

4. China Academy of Space Technology, Beijing 100098, China

Abstract

Modern processors often face challenges when handling instructions that overwhelm the branch target buffer (BTB), leading to front-end bottlenecks. As the BTB’s capacity increases, its prediction module can become slower and power-hungry. In this paper, we introduce a straightforward yet highly effective two-level prediction mechanism to mitigate the escalating power consumption in the BTB structure, achieved by reducing the number of accesses. Our approach incorporates two main elements: M-BTB and V-BTB. M-BTB encompasses the first-level prediction mechanism and a fully associative BTB, while V-BTB houses the second-level prediction mechanism and a set-associative BTB. To implement our prediction mechanism, we optimize the traditional two-level BTB structure. The first level employs the skew mechanism, and the second level dissects the tag bits to create the Partial tag. These two levels of prediction mechanism correspond to the bank/way prediction for the two-level BTB structure. Our experimental results show that the first-stage prediction mechanism reduces M-BTB accesses by 75%, while the second-stage prediction mechanism ensures that over 98% of addresses require just zero or one way of the V-BTB to achieve a hit result. Our proposed approach achieves a remarkable 86–97% reduction in power consumption, with a minimal impact on performance and an increase in overall area efficiency.

Publisher

MDPI AG

Reference31 articles.

1. Vishal, G., and Panda, B. (2022, January 17–22). Micro btb: A high performance and storage efficient last-level branch target buffer for servers. Proceedings of the 19th ACM International Conference on Computing Frontiers, Turin, Italy.

2. Design of a G-Share branch predictor for EISC processor;Kim;IEIE Trans. Smart Process. Comput.,2015

3. Ranganathan, N., Nagarajan, R., Jimenez, D., Burger, D., Keckler, S.W., and Lin, C. (2002). Combining Hyperblocks and Exit Prediction to Increase Front-End Bandwidth and Performance, The University of Texas at Austin.

4. Seznec, A. (2011, January 3–7). A new case for the tage branch predictor. Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture, Porto Alegre, Brazil.

5. The optimum pipeline depth for a microprocessor;Hartstein;ACM Sigarch Comput. Archit. News,2002

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3