On the energy (In)efficiency of Hadoop clusters

被引:41
作者
Leverich J. [1 ]
Kozyrakis C. [1 ]
机构
[1] Computer Systems Laboratory, Stanford University
来源
Operating Systems Review (ACM) | 2010年 / 44卷 / 01期
关键词
D O I
10.1145/1740390.1740405
中图分类号
学科分类号
摘要
Distributed processing frameworks, such as Yahoo!'s Hadoop and Google's MapReduce, have been successful at harnessing expansive datacenter resources for large-scale data analysis. However, their effect on datacenter energy efficiency has not been scrutinized. Moreover, the filesystem component of these frameworks effectively precludes scale-down of clusters deploying these frameworks (i.e. operating at reduced capacity). This paper presents our early work on modifying Hadoop to allow scale-down of operational clusters. We find that running Hadoop clusters in fractional configurations can save between 9% and 50% of energy consumption, and that there is a tradeoff between performance energy consumption. We also outline further research into the energy-efficiency of these frameworks.
引用
收藏
页码:61 / 65
页数:4
相关论文
共 13 条
  • [1] Lustre: A Scalable, High Performance File System
  • [2] Apache. Hadoop
  • [3] Andre Barroso L., Holzle U., The case for energy-proportional computing, Computer, 40, 12, (2007)
  • [4] Standard Performance Evaluation Corporation
  • [5] Dean J., Ghemawat S., Map reduce: Simplified data processing on large clusters, Commun. ACM, (2008)
  • [6] Fay C., Et al., Bigtable: A Distributed Storage System for Structured Data, (2006)
  • [7] Fan X., Weber W., Barroso L.A., Power Provisioning for A Warehouse-sized Computer, (2007)
  • [8] Ghemawat S., Gobioff H., Leung S., The google file system, SIGOPS Oper. Syst. Rev., (2003)
  • [9] Intelligent Platform Management Interface
  • [10] Meisner D., Gold B.T., Wenisch T.F., PowerNap: Eliminating Server Idle Power, (2009)