DaeMon: Architectural Support for Efficient Data Movement in Fully Disaggregated Systems

Author:

Giannoula Christina1ORCID,Huang Kailong2ORCID,Tang Jonathan2ORCID,Koziris Nectarios3ORCID,Goumas Georgios3ORCID,Chishti Zeshan4ORCID,Vijaykumar Nandita2ORCID

Affiliation:

1. University of Toronto & National Technical University of Athens, Toronto, ON, Canada

2. University of Toronto, Toronto, ON, Canada

3. National Technical University of Athens, Athens, Greece

4. Intel Corporation, Portland, OR, USA

Abstract

Resource disaggregation offers a cost effective solution to resource scaling, utilization, and failure-handling in data centers by physically separating hardware devices in a server. Servers are architected as pools of processor, memory, and storage devices, organized as independent failure-isolated components interconnected by a high-bandwidth network. A critical challenge, however, is the high performance penalty of accessing data from a remote memory module over the network. Addressing this challenge is difficult as disaggregated systems have high runtime variability in network latencies/bandwidth, and page migration can significantly delay critical path cache line accesses in other pages.This paper conducts a characterization analysis on different data movement strategies in fully disaggregated systems, evaluates their performance overheads in a variety of workloads, and introduces DaeMon, the first software-transparent mechanism to significantly alleviate data movement overheads in fully disaggregated systems. First, to enable scalability to multiple hardware components in the system, we enhance each compute and memory unit with specialized engines that transparently handle data migrations. Second, to achieve high performance and provide robustness across various network, architecture and application characteristics, we implement a synergistic approach of bandwidth partitioning, link compression, decoupled data movement of multiple granularities, and adaptive granularity selection in data movements. We evaluate DaeMon in a wide variety of workloads at different network and architecture configurations using a state-of-the-art simulator. DaeMon improves system performance and data access costs by 2.39× and 3.06×, respectively, over the widely-adopted approach of moving data at page granularity.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Safety, Risk, Reliability and Quality,Computer Science (miscellaneous)

Reference114 articles.

1. B. Abali H. Franke D. E. Poff R. A. Saccone C. O. Schulz L. M. Herger and T. B. Smith. 2001. Memory Expansion Technology (MXT): Software Support and Performance. IBM Journal of Research and Development (2001). B. Abali H. Franke D. E. Poff R. A. Saccone C. O. Schulz L. M. Herger and T. B. Smith. 2001. Memory Expansion Technology (MXT): Software Support and Performance. IBM Journal of Research and Development (2001).

2. Atul Adya Robert Grandl Daniel Myers and Henry Qin. 2019. Fast Key-Value Stores: An Idea Whose Time Has Come and Gone. In HotOS. Atul Adya Robert Grandl Daniel Myers and Henry Qin. 2019. Fast Key-Value Stores: An Idea Whose Time Has Come and Gone. In HotOS.

3. Neha Agarwal , David Nellans , Mark Stephenson , Mike O'Connor , and Stephen W . Keckler . 2015 . Page Placement Strategies for GPUs within Heterogeneous Memory Systems. In ASPLOS. Neha Agarwal, David Nellans, Mark Stephenson, Mike O'Connor, and Stephen W. Keckler. 2015. Page Placement Strategies for GPUs within Heterogeneous Memory Systems. In ASPLOS.

4. Neha Agarwal and Thomas F . Wenisch . 2017 . Thermostat : Application-Transparent Page Management for Two-Tiered Main Memory. In ASPLOS. Neha Agarwal and Thomas F. Wenisch. 2017. Thermostat: Application-Transparent Page Management for Two-Tiered Main Memory. In ASPLOS.

5. Marcos K. Aguilera , Nadav Amit , Irina Calciu , Xavier Deguillard , Jayneel Gandhi , Stanko Novaković , Arun Ramanathan , Pratap Subrahmanyam , Lalith Suresh , Kiran Tati , Rajesh Venkatasubramanian , and Michael Wei . 2018 . Remote Regions: A Simple Abstraction for Remote Memory. In ATC. Marcos K. Aguilera, Nadav Amit, Irina Calciu, Xavier Deguillard, Jayneel Gandhi, Stanko Novaković, Arun Ramanathan, Pratap Subrahmanyam, Lalith Suresh, Kiran Tati, Rajesh Venkatasubramanian, and Michael Wei. 2018. Remote Regions: A Simple Abstraction for Remote Memory. In ATC.

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Morpheus: An Adaptive DRAM Cache with Online Granularity Adjustment for Disaggregated Memory;2023 IEEE 41st International Conference on Computer Design (ICCD);2023-11-06

2. Architectural Support for Efficient Data Movement in Fully Disaggregated Systems;ACM SIGMETRICS Performance Evaluation Review;2023-06-26

3. Architectural Support for Efficient Data Movement in Fully Disaggregated Systems;Abstract Proceedings of the 2023 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems;2023-06-19

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3