Dynamo

Author:

Wu Qiang1,Deng Qingyuan1,Ganesh Lakshmi1,Hsu Chang-Hong2,Jin Yun1,Kumar Sanjeev1,Li Bin1,Meza Justin1,Song Yee Jiun1

Affiliation:

1. Facebook, Inc.

2. University of Michigan

Abstract

Data center power is a scarce resource that often goes underutilized due to conservative planning. This is because the penalty for overloading the data center power delivery hierarchy and tripping a circuit breaker is very high, potentially causing long service outages. Recently, dynamic server power capping, which limits the amount of power consumed by a server, has been proposed and studied as a way to reduce this penalty, enabling more aggressive utilization of provisioned data center power. However, no real at-scale solution for data center-wide power monitoring and control has been presented in the literature. In this paper, we describe Dynamo -- a data center-wide power management system that monitors the entire power hierarchy and makes coordinated control decisions to safely and efficiently use provisioned data center power. Dynamo has been developed and deployed across all of Facebook's data centers for the past three years. Our key insight is that in real-world data centers, different power and performance constraints at different levels in the power hierarchy necessitate coordinated data center-wide power management. We make three main contributions. First, to understand the design space of Dynamo, we provide a characterization of power variation in data centers running a diverse set of modern workloads. This characterization uses fine-grained power samples from tens of thousands of servers and spanning a period of over six months. Second, we present the detailed design of Dynamo. Our design addresses several key issues not addressed by previous simulation-based studies. Third, the proposed techniques and design have been deployed and evaluated in large scale data centers serving billions of users. We present production results showing that Dynamo has prevented 18 potential power outages in the past 6 months due to unexpected power surges; that Dynamo enables optimizations leading to a 13% performance boost for a production Hadoop cluster and a nearly 40% performance increase for a search cluster; and that Dynamo has already enabled an 8% increase in the power capacity utilization of one of our data centers with more aggressive power subscription measures underway.

Publisher

Association for Computing Machinery (ACM)

Reference34 articles.

1. Power provisioning for a warehouse-sized computer

2. J. Hamilton "Internet-scale Service Infrastructure Efficiency " ISCA Keynote 2009. 10.1145/1555754.1555756 J. Hamilton "Internet-scale Service Infrastructure Efficiency " ISCA Keynote 2009. 10.1145/1555754.1555756

3. Ensemble-level Power Management for Dense Blade Servers

Cited by 22 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Can Storage Devices be Power Adaptive?;Proceedings of the 16th ACM Workshop on Hot Topics in Storage and File Systems;2024-07-08

2. Impact of power consumption in containerized clouds: A comprehensive analysis of open-source power measurement tools;Computer Networks;2024-05

3. Expanding Datacenter Capacity with DVFS Boosting: A safe and scalable deployment experience;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1;2024-04-17

4. An End-to-End HPC Framework for Dynamic Power Objectives;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12

5. DCMigrationALG: A Power-Aware Data Center Container Migration Algorithm;Journal of Physics: Conference Series;2023-08-01

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3