Improving System Energy Efficiency with Memory Rank Subsetting

Author:

Ahn Jung Ho1,Jouppi Norman P.2,Kozyrakis Christos3,Leverich Jacob3,Schreiber Robert S.2

Affiliation:

1. Seoul National University

2. Hewlett-Packard Labs

3. Stanford University

Abstract

VLSI process technology scaling has enabled dramatic improvements in the capacity and peak bandwidth of DRAM devices. However, current standard DDR x DIMM memory interfaces are not well tailored to achieve high energy efficiency and performance in modern chip-multiprocessor-based computer systems. Their suboptimal performance and energy inefficiency can have a significant impact on system-wide efficiency since much of the system power dissipation is due to memory power. New memory interfaces, better suited for future many-core systems, are needed. In response, there are recent proposals to enhance the energy efficiency of main-memory systems by dividing a memory rank into subsets, and making a subset rather than a whole rank serve a memory request. We holistically assess the effectiveness of rank subsetting from system-wide performance, energy-efficiency, and reliability perspectives. We identify the impact of rank subsetting on memory power and processor performance analytically, compare two promising rank-subsetting proposals, Multicore DIMM and mini-rank, and verify our analysis by simulating a chip-multiprocessor system using multithreaded and consolidated workloads. We extend the design of Multicore DIMM for high-reliability systems and show that compared with conventional chipkill approaches, rank subsetting can lead to much higher system-level energy efficiency and performance at the cost of additional DRAM devices. This holistic assessment shows that rank subsetting offers compelling alternatives to existing processor-memory interfaces for future DDR systems.

Funder

Ministry of Education, Science and Technology

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Reference55 articles.

1. Architecture---The design space of data-parallel memory systems

2. Multicore DIMM: an Energy Efficient Memory Module with Independently Controlled DRAMs

3. Future scaling of processor-memory interfaces

4. AMD. 2007. BIOS and Kernel Developer’s Guide for AMD NPT Family 0Fh Processors. http://www.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/32559.pdf. AMD . 2007. BIOS and Kernel Developer’s Guide for AMD NPT Family 0Fh Processors . http://www.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/32559.pdf.

5. The Price of Performance

Cited by 18 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Smart Devices and SRAM: Analyzing Their Impact;2023 International Conference on Sustainable Emerging Innovations in Engineering and Technology (ICSEIET);2023-09-14

2. RBC: A memory architecture for improved performance and energy efficiency;Tsinghua Science and Technology;2021-06

3. FIGARO: Improving System Performance via Fine-Grained In-DRAM Data Relocation and Caching;2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO);2020-10

4. Survey on memory management techniques in heterogeneous computing systems;IET Computers & Digital Techniques;2020-01-21

5. Quantifying Memory Underutilization in HPC Systems and Using it to Improve Performance via Architecture Support;Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture;2019-10-12

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3