PERFORMANCE ANALYSIS OF MESH INTERCONNECTION NETWORKS WITH DETERMINISTIC ROUTING

被引:61
作者
ADVE, VS [1 ]
VERNON, MK [1 ]
机构
[1] UNIV WISCONSIN,DEPT COMP SCI,MADISON,WI 53706
基金
美国国家科学基金会;
关键词
APPROXIMATE MEAN VALUE ANALYSIS; CLOSED QUEUING NETWORKS; FINITE BUFFERS; HOT-SPOTS; MULTIPROCESSOR INTERCONNECTION NETWORKS; K-ARY N-CUBE NETWORKS; MESH NETWORKS; NEARNEIGHBOR COMMUNICATION; PERFORMANCE ANALYSIS; WORMHOLE ROUTING;
D O I
10.1109/71.277793
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper develops detailed analytical performance models for k-ary n-cube networks with single-flit or infinite buffers, wormhole routing, and the nonadaptive deadlock-free routing scheme proposed by Dally and Seitz. In contrast to previous performance studies of such networks, the system is modeled as a closed queueing network that 1) includes the effects of blocking and pipelining of messages in the network, 2) allows for arbitrary source-destination probability distributions, and 3) explicitly models the virtual channels used in the deadlock-free routing algorithm. The models are used to examine several performance issues for 2-D networks with shared-memory traffic. Some results obtained are: 1) when processors are allowed to have multiple outstanding requests, system performance is bandwidth-limited, and hence network performance does not scale well with increasing system size; 2) communication locality improves system efficiency, but a very high level of locality is needed in order for system performance to scale well; 3) in contrast to previous hot-spot studies for indirect networks that assume nonblocking processors, this study finds that significant tree-saturation does not occur, even in the presence of severe hot-spots in systems with up to four outstanding requests per processor; and 4) at some plausible system operating points, there is a perceptible difference in the efficiencies of processors at different locations in the mesh because of asymmetric loads on the virtual channels by the deadlock avoidance algorithm. These results should prove useful for engineering high-performance systems based on low-dimensional k-ary n-cube networks.
引用
收藏
页码:225 / 246
页数:22
相关论文
共 23 条
[1]  
AGARWAL A, 1990, 17TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P104, DOI 10.1109/ISCA.1990.134498
[3]  
BOLDING K, 1992, ADVANCED RESEARCH IN VLSI AND PARALLEL SYSTEMS, P333
[4]  
BOLDING K, 1992, 920707 U WASH DEP CO
[5]  
BORKAR S, 1988, NOV P SUP 88
[6]   LINEARIZER - A HEURISTIC ALGORITHM FOR QUEUING NETWORK MODELS OF COMPUTING SYSTEMS [J].
CHANDY, KM ;
NEUSE, D .
COMMUNICATIONS OF THE ACM, 1982, 25 (02) :126-134
[7]  
CHITTOR S, 1990, 1990 P INT C PAR PRO
[8]  
DALLY WJ, 1987, IEEE T COMPUT, V36, P547, DOI 10.1109/TC.1987.1676939
[9]   PERFORMANCE ANALYSIS OF K-ARY N-CUBE INTERCONNECTION NETWORKS [J].
DALLY, WJ .
IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (06) :775-785
[10]  
DALLY WJ, 1986, THESIS CAL I TECH