Abstract
Distributed processing frameworks, such as Yahoo!'s Hadoop and Google's MapReduce, have been successful at harnessing expansive datacenter resources for large-scale data analysis. However, their effect on datacenter energy efficiency has not been scrutinized. Moreover, the filesystem component of these frameworks effectively precludes scale-down of clusters deploying these frameworks (i.e. operating at reduced capacity). This paper presents our early work on modifying Hadoop to allow scale-down of operational clusters. We find that running Hadoop clusters in fractional configurations can save between 9% and 50% of energy consumption, and that there is a tradeoff between performance energy consumption. We also outline further research into the energy-efficiency of these frameworks.
Publisher
Association for Computing Machinery (ACM)
Reference13 articles.
1. Lustre: A Scalable High Performance File System. http://lustre.org/. Lustre: A Scalable High Performance File System. http://lustre.org/.
2. Apache. Hadoop. http://hadoop.apache.org/. Apache. Hadoop. http://hadoop.apache.org/.
3. The Case for Energy-Proportional Computing
4. Standard Performance Evaluation Corporation. Specpower_ssj2008. http://www.spec.org/power_ssj2008/. Standard Performance Evaluation Corporation. Specpower_ssj2008. http://www.spec.org/power_ssj2008/.
5. MapReduce
Cited by
107 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献