Optimizing task layout on the Blue Gene/L supercomputer

被引:35
作者
Bhanot, G
Gara, A
Heidelberger, P
Lawless, E
Sexton, JC
Walkup, R
机构
[1] IBM Corp, Div Res, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Univ Dublin Trinity Coll, Ctr High Performance Comp, Dublin 2, Ireland
关键词
D O I
10.1147/rd.492.0489
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A general method for optimizing problem layout on the Blue Gene((R))/L (BG/L) supercomputer is described. The method takes as input the communication matrix of an arbitrary problem as an array with entries C(i.j.), which represents the data communicated from domain i to domain j. Gh,en C(i, j), we implement a heuristic map that attempts to sequentially map a domain and its communication neighbors either to the same BG/L node or to nearneighbor nodes on the BG/L torus, while keeping the number of domains mapped to a BG/L node constant. We then generate a A,Markov, chain of maps using Monte Carlo simulation with free energy F = Sigma(ij) C(i,j) H(i, j), where H(i, j) is the smallest number of hops on the BG/L torus between domain i and domain j. For two large parallel applications, SAGE and UMT2000, the method was tested against the default Message Passing Interface rank order layout on up to 2,048 BG/L nodes. It produced maps that improved communication efficiency by up to 45%.
引用
收藏
页码:489 / 500
页数:12
相关论文
共 13 条
[1]  
[Anonymous], 2002, P ACM IEEE C SUPERCO
[2]  
[Anonymous], 1987, SIMULATED ANNEALING
[3]  
*ASCI, PURL BENCHM
[4]   The BlueGene/L supercomputer [J].
Bhanot, G ;
Chen, D ;
Gara, A ;
Vranas, P .
NUCLEAR PHYSICS B-PROCEEDINGS SUPPLEMENTS, 2003, 119 :114-121
[5]   AN EFFICIENT ALGORITHM FOR A TASK ALLOCATION PROBLEM [J].
BILLIONNET, A ;
COSTA, MC ;
SUTTER, A .
JOURNAL OF THE ACM, 1992, 39 (03) :502-518
[6]  
Bremaud P., 1999, MARKOV CHAINS GIBBS
[7]   A NEW MAPPING HEURISTIC BASED ON MEAN FIELD ANNEALING [J].
BULTAN, T ;
AYKANAT, C .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1992, 16 (04) :292-305
[8]  
Dixit-Radiya V. A., 1993, Proceedings of the Fifth IEEE Symposium on Parallel and Distributed Processing (Cat. No.93TH0584-3), P674, DOI 10.1109/SPDP.1993.395467
[9]   TASK ALLOCATION ONTO A HYPERCUBE BY RECURSIVE MINCUT BIPARTITIONING [J].
ERCAL, F ;
RAMANUJAM, J ;
SADAYAPPAN, P .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1990, 10 (01) :35-44
[10]  
Kerbyson D. J., 2001, P ACM IEEE C SUP, P1, DOI [10.1145/582034.582071, DOI 10.1145/582034.582071]