On the energy (in)efficiency of Hadoop clusters-Reference-Cited by-同舟云学术

On the energy (in)efficiency of Hadoop clusters

Published:2010-03-12 Issue:1 Volume:44 Page:61-65
ISSN:0163-5980
Container-title:ACM SIGOPS Operating Systems Review
language:en
Short-container-title:SIGOPS Oper. Syst. Rev.

Author:

Leverich Jacob¹,Kozyrakis Christos¹

Affiliation:

1. Stanford University

Abstract

Distributed processing frameworks, such as Yahoo!'s Hadoop and Google's MapReduce, have been successful at harnessing expansive datacenter resources for large-scale data analysis. However, their effect on datacenter energy efficiency has not been scrutinized. Moreover, the filesystem component of these frameworks effectively precludes scale-down of clusters deploying these frameworks (i.e. operating at reduced capacity). This paper presents our early work on modifying Hadoop to allow scale-down of operational clusters. We find that running Hadoop clusters in fractional configurations can save between 9% and 50% of energy consumption, and that there is a tradeoff between performance energy consumption. We also outline further research into the energy-efficiency of these frameworks.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/1740390.1740405

Reference13 articles.

1. Lustre: A Scalable High Performance File System. http://lustre.org/. Lustre: A Scalable High Performance File System. http://lustre.org/.

2. Apache. Hadoop. http://hadoop.apache.org/. Apache. Hadoop. http://hadoop.apache.org/.

3. The Case for Energy-Proportional Computing

4. Standard Performance Evaluation Corporation. Specpower_ssj2008. http://www.spec.org/power_ssj2008/. Standard Performance Evaluation Corporation. Specpower_ssj2008. http://www.spec.org/power_ssj2008/.

5. MapReduce

Cited by 107 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Energy consumption estimation and profiling for queries in distributed database systems based on a bottom-up comprehensive energy model;Future Generation Computer Systems;2024-10

2. A self-adaptive density-based clustering algorithm for varying densities datasets with strong disturbance factor;Data & Knowledge Engineering;2024-09

3. On Handling Bigdata Efficiently and Securely with Customised Control-An Illustrative Study;2024 International Conference on Advances in Computing, Communication, Electrical, and Smart Systems (iCACCESS);2024-03-08

4. Reducing Cloud Workload Costs in Geographically Distributed Data Centers with GeoSched;2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT);2023-07-06

5. Energy Saving Techniques for Cloud Data Centres: An Empirical Research Analysis;Lecture Notes in Electrical Engineering;2023