NUCA-2A: A New Adaptive and Behavior Aware Block Placement Process

Author:

Souahi Mohamed Salah1,Mohammed Mohamed Ben1

Affiliation:

1. LIRE Laboratory, University of Abdelhamid MEHRI, Constantine, Algeria

Abstract

Background: The last three decades were marked by a spectacular evolution of CPUs. Both cores number on chip and shared Low Level Cache (LLC) size are increasing what makes LLC the bottleneck's system. One major weakness of future cache memory hierarchies will be to carry out memory blocks availability for vertical requests, with no consideration to horizontal proximity to cores. Simulations show that some LLC accesses cost more latency cycles than off-chip accesses. Objective: This paper presents a new adaptive and blocks behavior aware process, called NUCA-2A. It manages blocks in LLC in a purpose of reducing it's latency, and it's inner bandwidth, by studying each block's behavior, and by placing it in the most suitable location among LLC banks. Methods: LLC accesses are classified basing on each one's specific behavior. Authors establish also a two levels horizontal hierarchy in LLC. This work consists to place blocks in the zones that matches the best their behaviors. Results: In contrast to the classic S-NUCA scheme, NUCA-2A makes a reduction of up to 60,39% of global LLC latency as well as 40,74% of average inner traffic. It makes also an average speedup of 17,89 % in term of number of instructions executed by cycle. Conclusion: Behaviors study gives encouraging results. Several methods are in use in different fields to forecast a behavior basing on previous observations. We are working on a prefetching model that permits blocks migration to and from privileged banks.

Funder

CNEPRU Project

Publisher

Bentham Science Publishers Ltd.

Subject

General Computer Science

Reference27 articles.

1. Benczur A. The digital universe – an information theoretical analyses

2. Intel Corporation. Intel Xeon Phi Processor 7290. Available from: https://ark.intel.com/products/95831/Intel-Xeon-Phi-Processor-729 0F-16GB-1_50-GHz-72-core,2018.\newblock Accessed on 2018- 06-30.

3. Mellanox Technologies.TILE-Gx72 Processor, 2018. Available from: http://www.mellanox.com/page/products_dyn?product_ family= 238&mtag=tile_gx72 (Accessed: 30th Jun 2018).

4. . MIT Computer Science and Artificial Intelligence Laboratory. The Angstrom Project. Available from:

5. Modha D. The brains architecture, efficiency on a chip

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3