Adaptive communication algorithms for distributed heterogeneous systems

被引:16
作者
Bhat, PB
Prasanna, VK
Raghavendra, CS
机构
[1] Univ So Calif, Dept Elect Engn Syst, Los Angeles, CA 90089 USA
[2] Aerosp Corp, Los Angeles, CA 90009 USA
关键词
collective communication; open shop scheduling; heterogeneous networks; metacomputing; adaptive communication scheduling; graph matching;
D O I
10.1006/jpdc.1999.1571
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many grand challenge applications can benefit from metacomputing, i.e., the coordinated use of,geographically distributed heterogeneous supercomputers. A salient feature of such systems is the heterogeneity in the network performance between different processor pairs. This paper considers the problem of efficient application-level communication in heterogeneous network-based systems. We present a uniform communication scheduling framework for developing adaptive communication schedules for various collective communication patterns. The Framework enables schedules to be developed at runtime, based on network performance information obtained from a directory service. Based on this framework: we have developed communication schedules for the total exchange communication pattern. Our first algorithm develops a schedule by computing a series of matchings in a bipartite graph. We also present a heuristic algorithm based on the open shop scheduling problem. The completion time of the heuristic is guaranteed to be within twice the optimal. Simulation results show performance improvements by a factor of 5 over well-known homogeneous scheduling techniques. This paper is an early effort in formalizing and solving communication problems for metacomputing systems. We discuss several research issues that must be addressed to allow efficient collective communication in such environments. (C) 1999 Academic Press.
引用
收藏
页码:252 / 279
页数:28
相关论文
共 22 条
[1]   CCL - A PORTABLE AND TUNABLE COLLECTIVE COMMUNICATION LIBRARY FOR SCALABLE PARALLEL COMPUTERS [J].
BALA, V ;
BRUCK, J ;
CYPHER, R ;
ELUSTONDO, P ;
HO, A ;
HO, CT ;
KIPNIS, S ;
SNIR, M .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1995, 6 (02) :154-164
[2]  
BHAT PB, 1999, THESIS U SO CALIFORN
[3]  
BHAT PB, 1998, P ISCA INT C PAR DIS, P242
[4]  
BHAT PB, 1999, P IEEE INT C DISTR C
[5]   CONSTRUCTIVE HEURISTIC ALGORITHMS FOR THE OPEN SHOP PROBLEM [J].
BRASEL, H ;
TAUTENHAHN, T ;
WERNER, F .
COMPUTING, 1993, 51 (02) :95-110
[6]  
Brucker P., 1995, SCHEDULING ALGORITHM
[7]  
DEWITT T, 1997, CMUCS97194 SCH COMP
[8]   A directory service for configuring high-performance distributed computations [J].
Fitzgerald, S ;
Foster, I ;
Kesselman, C ;
vonLaszewski, G ;
Smith, W ;
Tuecke, S .
SIXTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 1997, :365-375
[9]   Globus: A metacomputing infrastructure toolkit [J].
Foster, I ;
Kesselman, C .
INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1997, 11 (02) :115-128
[10]  
GONZALEZ T, 1976, J ACM, V23, P665, DOI 10.1145/321978.321985