Disaggregated memory for expansion and sharing in blade servers

Author:

Lim Kevin1,Chang Jichuan2,Mudge Trevor1,Ranganathan Parthasarathy2,Reinhardt Steven K.3,Wenisch Thomas F.1

Affiliation:

1. University of Michigan, Ann Arbor, MI, USA

2. Hewlett-Packard Labs, Palo Alto, CA, USA

3. Advanced Micro Devices, Inc., Bellevue, USA

Abstract

Analysis of technology and application trends reveals a growing imbalance in the peak compute-to-memory-capacity ratio for future servers. At the same time, the fraction contributed by memory systems to total datacenter costs and power consumption during typical usage is increasing. In response to these trends, this paper re-examines traditional compute-memory co-location on a single system and details the design of a new general-purpose architectural building block-a memory blade-that allows memory to be "disaggregated" across a system ensemble. This remote memory blade can be used for memory capacity expansion to improve performance and for sharing memory across servers to reduce provisioning and power costs. We use this memory blade building block to propose two new system architecture solutions-(1) page-swapped remote memory at the virtualization layer, and (2) block-access remote memory with support in the coherence hardware-that enable transparent memory expansion and sharing on commodity-based systems. Using simulations of a mix of enterprise benchmarks supplemented with traces from live datacenters, we demonstrate that memory disaggregation can provide substantial performance benefits (on average 10X) in memory constrained environments, while the sharing enabled by our solutions can improve performance-per-dollar by up to 57% when optimizing memory provisioning across multiple servers.

Publisher

Association for Computing Machinery (ACM)

Reference41 articles.

1. VMWare Performance Team Blogs. Ten Reasons Why Oracle Databases Run Best on VMWare "Scale up with Large Memory." http://tinyurl.com/cudjuy VMWare Performance Team Blogs. Ten Reasons Why Oracle Databases Run Best on VMWare "Scale up with Large Memory." http://tinyurl.com/cudjuy

2. SIA. International Technology Roadmap for Semiconductors 2007 Edition 2007. SIA. International Technology Roadmap for Semiconductors 2007 Edition 2007.

3. HP. Memory technology evolution: an overview of system memory technologies. http://tinyurl.com/ctfjs2 HP. Memory technology evolution: an overview of system memory technologies. http://tinyurl.com/ctfjs2

4. Power aware page allocation

Cited by 188 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. An Introduction to the Compute Express Link (CXL) Interconnect;ACM Computing Surveys;2024-07-08

2. AStore: Uniformed Adaptive Learned Index and Cache for RDMA-Enabled Key-Value Store;IEEE Transactions on Knowledge and Data Engineering;2024-07

3. Optimizing LSM-based indexes for disaggregated memory;The VLDB Journal;2024-06-19

4. DistMind: Efficient Resource Disaggregation for Deep Learning Workloads;IEEE/ACM Transactions on Networking;2024-06

5. Evaluating the potential of disaggregated memory systems for HPC applications;Concurrency and Computation: Practice and Experience;2024-05-31

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3