MPICH-G2: A Grid-enabled implementation of the Message Passing Interface

被引:284
作者
Karonis, NT [1 ]
Toonen, B
Foster, I
机构
[1] No Illinois Univ, Dept Comp Sci, De Kalb, IL 60115 USA
[2] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
[3] Univ Chicago, Chicago, IL 60637 USA
关键词
MPI; Grid computing; message passing; Globus Toolkit; MPICH-G2;
D O I
10.1016/S0743-7315(03)00002-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Application development for distributed-computing "Grids" can benefit from tools that variously hide or enable application-level management of critical aspects of the heterogeneous environment. As part of an investigation of these issues, we have developed MPICH-G2, a Grid-enabled implementation of the Message Passing Interface (MPI) that allows a user to run MPI programs across multiple computers, at the same or different sites, using the same commands that would be used on a parallel computer. This library extends the Argonne MPICH implementation of MPI to use services provided by the Globus Toolkit for authentication, authorization, resource allocation, executable staging, and I/O, as well as for process creation, monitoring, and control. Various performance-critical operations, including startup and collective operations, are configured to exploit network topology information. The library also exploits MPI constructs for performance management; for example, the MPI communicator construct is used for application-level discovery of, and adaptation to, both network topology and network quality-of-service mechanisms. We describe the MPICH-G2 design and implementation, present performance results, and review application experiences, including record-setting distributed simulations. (C) 2003 Elsevier Science (USA). All rights reserved.
引用
收藏
页码:551 / 563
页数:13
相关论文
共 44 条
[1]  
ABRAMSON D, 1995, P 4 IEEE S HIGH PERF
[2]  
ALLEN G, 2001, P SUP 2001
[3]  
BARNETT M, 1994, J PARALLEL DISTRIBUT, V22, P324
[4]  
BARYNOY A, 1992, P 4 ANN ACM S PAR AL, P559
[5]  
BESTER J, 1999, P IOPADS 99
[6]  
Burns G., 1994, P SUPERCOMPUTING S, P379
[7]  
CASANOVA H, 1995, CS95313 U TENN
[8]  
CHEN J, 1998, P 7 IEEE S HIGH PERF
[9]  
CULLER D, 1993, P 4 ACM SIGPLAN S PR, P1
[10]  
Czajkowski K, 1998, 4 WORKSH JOB SCHED S