Efficient collective communication in distributed heterogeneous systems

被引:56
作者
Bhat, PB
Raghavendra, CS
Prasanna, VK [1 ]
机构
[1] Univ So Calif, Dept EE Syst, EEB 200C, Los Angeles, CA 90089 USA
[2] Univ So Calif, Dept EE Syst, Aerosp Corp, Los Angeles, CA 90089 USA
[3] Univ So Calif, Dept EE Syst, EEB 246, Los Angeles, CA 90089 USA
关键词
metacomputing; heterogeneous networks; collective communication scheduling; broadcast; multicast;
D O I
10.1016/S0743-7315(03)00008-X
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With recent advances in high-speed networks, distributed heterogeneous computing has emerged as an attractive computational paradigm. Wide-area grid infrastructures will enable distributed applications-such as video conferencing and distributed interactive simulation-to seamlessly integrate collections of heterogeneous workstations, multiprocessors, and mobile nodes. The underlying network is typically a collection of several heterogeneous links, of different networking technologies. Such a heterogeneous network is also typical in local area workstation clusters, which are increasingly being used as alternatives to parallel computing systems. This paper introduces a framework for developing efficient collective communication schedules over such heterogeneous networks. We focus on application-level communication, between processes of a parallel program. Our framework consists of analytical models of the heterogeneous system, scheduling algorithms for the collective communication pattern, and performance evaluation mechanisms. We show that previous models, which considered node heterogeneity but ignored network heterogeneity, can lead to solutions which are worse than the optimal by an unbounded factor. We then introduce an enhanced communication model, and develop three heuristic algorithms for the broadcast and multicast patterns. The completion time of the schedule is chosen as the performance metric. The heuristic algorithms are fastest edge first (FEF), earliest completing edge first (ECEF), and ECEF with look-ahead. For small system sizes, we find the optimal solution using exhaustive search. Our simulation experiments indicate that the performance of our heuristic algorithms is close to optimal. For performance evaluation of larger systems, we have also developed a simple lower bound on the completion time. Our heuristic algorithms achieve significant performance improvements over previous approaches. (C) 2003 Elsevier Science (USA). All rights reserved.
引用
收藏
页码:251 / 263
页数:13
相关论文
共 23 条
[1]  
[Anonymous], 1998, GRID BLUEPRINT NEW C
[2]   CCL - A PORTABLE AND TUNABLE COLLECTIVE COMMUNICATION LIBRARY FOR SCALABLE PARALLEL COMPUTERS [J].
BALA, V ;
BRUCK, J ;
CYPHER, R ;
ELUSTONDO, P ;
HO, A ;
HO, CT ;
KIPNIS, S ;
SNIR, M .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1995, 6 (02) :154-164
[3]   Efficient collective communication on Heterogeneous Networks of Workstations [J].
Banikazemi, M ;
Moorthy, V ;
Panda, DK .
1998 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - PROCEEDINGS, 1998, :460-467
[4]   Adaptive communication algorithms for distributed heterogeneous systems [J].
Bhat, PB ;
Prasanna, VK ;
Raghavendra, CS .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1999, 59 (02) :252-279
[5]   Efficient message passing interface (MPI) for parallel computing on clusters of workstations [J].
Bruck, J ;
Dolev, D ;
Ho, CT ;
Rosu, MC ;
Strong, R .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 40 (01) :19-34
[6]   Globus: A metacomputing infrastructure toolkit [J].
Foster, I ;
Kesselman, C .
INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1997, 11 (02) :115-128
[7]  
FOSTER I, 1998, P SUP C
[8]   EFFICIENT ALGORITHMS FOR FINDING MINIMUM SPANNING-TREES IN UNDIRECTED AND DIRECTED-GRAPHS [J].
GABOW, HN ;
GALIL, Z ;
SPENCER, T ;
TARJAN, RE .
COMBINATORICA, 1986, 6 (02) :109-122
[9]  
JACUNSKI M, 1999, P COMM ARCH SUPP NET
[10]   Exploiting multiple heterogeneous networks to reduce communication costs in parallel programs [J].
Kim, J ;
Lilja, DJ .
SIXTH HETEROGENEOUS COMPUTING WORKSHOP (HCW '97), PROCEEDINGS, 1997, :83-95