On the energy (in)efficiency of Hadoop clusters

Author:

Leverich Jacob1,Kozyrakis Christos1

Affiliation:

1. Stanford University

Abstract

Distributed processing frameworks, such as Yahoo!'s Hadoop and Google's MapReduce, have been successful at harnessing expansive datacenter resources for large-scale data analysis. However, their effect on datacenter energy efficiency has not been scrutinized. Moreover, the filesystem component of these frameworks effectively precludes scale-down of clusters deploying these frameworks (i.e. operating at reduced capacity). This paper presents our early work on modifying Hadoop to allow scale-down of operational clusters. We find that running Hadoop clusters in fractional configurations can save between 9% and 50% of energy consumption, and that there is a tradeoff between performance energy consumption. We also outline further research into the energy-efficiency of these frameworks.

Publisher

Association for Computing Machinery (ACM)

Reference13 articles.

1. Lustre: A Scalable High Performance File System. http://lustre.org/. Lustre: A Scalable High Performance File System. http://lustre.org/.

2. Apache. Hadoop. http://hadoop.apache.org/. Apache. Hadoop. http://hadoop.apache.org/.

3. The Case for Energy-Proportional Computing

4. Standard Performance Evaluation Corporation. Specpower_ssj2008. http://www.spec.org/power_ssj2008/. Standard Performance Evaluation Corporation. Specpower_ssj2008. http://www.spec.org/power_ssj2008/.

5. MapReduce

Cited by 107 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Energy consumption estimation and profiling for queries in distributed database systems based on a bottom-up comprehensive energy model;Future Generation Computer Systems;2024-10

2. A self-adaptive density-based clustering algorithm for varying densities datasets with strong disturbance factor;Data & Knowledge Engineering;2024-09

3. On Handling Bigdata Efficiently and Securely with Customised Control-An Illustrative Study;2024 International Conference on Advances in Computing, Communication, Electrical, and Smart Systems (iCACCESS);2024-03-08

4. Reducing Cloud Workload Costs in Geographically Distributed Data Centers with GeoSched;2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT);2023-07-06

5. Energy Saving Techniques for Cloud Data Centres: An Empirical Research Analysis;Lecture Notes in Electrical Engineering;2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3