Managing Data Transfers in Computer Clusters with Orchestra

被引:372
作者
Chowdhury, Mosharaf [1 ]
Zaharia, Matei [1 ]
Ma, Justin [1 ]
Jordan, Michael I. [1 ]
Stoica, Ion [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
Algorithms; design; performance; Data-intensive applications; data transfer; datacenter networks;
D O I
10.1145/2043164.2018448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cluster computing applications like MapReduce and Dryad transfer massive amounts of data between their computation stages. These transfers can have a significant impact on job performance, accounting for more than 50% of job completion times. Despite this impact, there has been relatively little work on optimizing the performance of these data transfers, with networking researchers traditionally focusing on per-flow traffic management. We address this limitation by proposing a global management architecture and a set of algorithms that (1) improve the transfer times of common communication patterns, such as broadcast and shuffle, and (2) allow scheduling policies at the transfer level, such as prioritizing a transfer over other transfers. Using a prototype implementation, we show that our solution improves broadcast completion times by up to 4.5x compared to the status quo in Hadoop. We also show that transfer-level scheduling can reduce the completion time of high-priority transfers by 1.7x.
引用
收藏
页码:98 / 109
页数:12
相关论文
共 37 条
[1]   Symbiotic Routing in Future Data Centers [J].
Abu-Libdeh, Hussam ;
Costa, Paolo ;
Rowstron, Antony ;
O'Shea, Greg ;
Donnelly, Austin .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) :51-62
[2]  
Al-Fares Mohammad., 2010, Proc. of NSDI
[3]  
Ananthanarayanan G., 2010, USENIX C OPERATING S
[4]  
[Anonymous], SAGE U PAPERS SERIES
[5]  
[Anonymous], 2009, HOTNETS
[6]  
[Anonymous], 2010, SIGMOD
[7]  
[Anonymous], 2010, PROC 2 USENIX C HOT
[8]  
[Anonymous], 2009, SIGCOMM
[9]   Ethane: Taking control of the enterprise [J].
Casado, Martin ;
Freedman, Michael J. ;
Pettit, Justin ;
Luo, Jianying ;
McKeown, Nick ;
Shenker, Scott .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2007, 37 (04) :1-12
[10]  
Castro M., 2003, SOSP