On the energy (In)efficiency of Hadoop clusters

被引：41

作者：

Leverich J. ^{[1
]}

Kozyrakis C. ^{[1
]}

机构：

[1] Computer Systems Laboratory, Stanford University

来源：

Operating Systems Review (ACM) | 2010年 / 44卷 / 01期

关键词：

D O I：

10.1145/1740390.1740405

中图分类号：

学科分类号：

摘要：

Distributed processing frameworks, such as Yahoo!'s Hadoop and Google's MapReduce, have been successful at harnessing expansive datacenter resources for large-scale data analysis. However, their effect on datacenter energy efficiency has not been scrutinized. Moreover, the filesystem component of these frameworks effectively precludes scale-down of clusters deploying these frameworks (i.e. operating at reduced capacity). This paper presents our early work on modifying Hadoop to allow scale-down of operational clusters. We find that running Hadoop clusters in fractional configurations can save between 9% and 50% of energy consumption, and that there is a tradeoff between performance energy consumption. We also outline further research into the energy-efficiency of these frameworks.

引用

页码：61 / 65

页数：4

共 13 条

[1] Lustre: A Scalable, High Performance File System
[2] Apache. Hadoop
[3] Andre Barroso L., Holzle U., The case for energy-proportional computing, Computer, 40, 12, (2007)
[4] Standard Performance Evaluation Corporation
[5] Dean J., Ghemawat S., Map reduce: Simplified data processing on large clusters, Commun. ACM, (2008)
[6] Fay C., Et al., Bigtable: A Distributed Storage System for Structured Data, (2006)
[7] Fan X., Weber W., Barroso L.A., Power Provisioning for A Warehouse-sized Computer, (2007)
[8] Ghemawat S., Gobioff H., Leung S., The google file system, SIGOPS Oper. Syst. Rev., (2003)
[9] Intelligent Platform Management Interface
[10] Meisner D., Gold B.T., Wenisch T.F., PowerNap: Eliminating Server Idle Power, (2009)

← 1 2 →