Joint Optimization of Idle and Cooling Power in Data Centers While Maintaining Response Time

被引:74
作者
Ahmad, Faraz [1 ]
Vijaykumar, T. N. [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
Design; Measurement; Performance; data center; power management; idle power; cooling power; response time;
D O I
10.1145/1735971.1736048
中图分类号
TP31 [计算机软件];
学科分类号
081205 [计算机软件];
摘要
Server power and cooling power amount to a significant fraction of modern data centers' recurring costs. While data centers provision enough servers to guarantee response times under the maximum loading, data centers operate under much less loading most of the times (e.g., 30-70% of the maximum loading). Previous server-power proposals exploit this under-utilization to reduce the server idle power by keeping active only as many servers as necessary and putting the rest into low-power standby modes. However, these proposals incur higher cooling power due to hot spots created by concentrating the data center loading on fewer active servers, or degrade response times due to standby-to-active transition delays, or both. Other proposals optimize the cooling power but incur considerable idle power. To address the first issue of power, we propose PowerTrade, which trades-off idle power and cooling power for each other, thereby reducing the total power. To address the second issue of response time, we propose SurgeGuard to overprovision the number of active servers beyond that needed by the current loading so as to absorb future increases in the loading. SurgeGuard is a two-tier scheme which uses well-known over-provisioning at coarse time granularities (e.g., one hour) to absorb the common, smooth increases in the loading, and a novel fine-grain replenishment of the over-provisioned reserves at fine time granularities (e.g., five minutes) to handle the uncommon, abrupt loading surges. Using real-world traces, we show that combining PowerTrade and SurgeGuard reduces total power by 30% compared to previous low-power schemes while maintaining response times within 1.7%.
引用
收藏
页码:243 / 256
页数:14
相关论文
共 33 条
[1]
Allen O, 1990, PROBABILITY STAT QUE
[2]
[Anonymous], The Linux Documentation Project
[3]
[Anonymous], 2001, WORKSH COMP OP SYST
[4]
*ANS INC, COMP FLUID DYN CFD S
[5]
The case for energy-proportional computing [J].
Barroso, Luiz Andre ;
Hoelzle, Urs .
COMPUTER, 2007, 40 (12) :33-+
[6]
Belady C., 2007, WHITE PAPER METRICS
[7]
Bohrer P, 2002, S COMP SCI, P261
[8]
Bolch G., 1998, Queuing Networks and Markov Chains
[9]
Chase J. S., 2001, S OP SYST PRINC, P103
[10]
Chen Y., 2005, SIGMETRICS Perform. Eval. Rev, V33, P303, DOI DOI 10.1145/1071690.1064253