FAULT-TOLERANT WORMHOLE ROUTING ALGORITHMS FOR MESH NETWORKS

被引:208
作者
BOPPANA, RV [1 ]
CHALASANI, S [1 ]
机构
[1] UNIV WISCONSIN,DEPT ELECT & COMP ENGN,MADISON,WI 53706
基金
美国国家科学基金会;
关键词
ADAPTIVE ROUTING; BLOCK FAULTS; DEADLOCKS; FAULT-TOLERANT ROUTING; MESH NETWORKS; MULTICOMPUTER NETWORKS; PERFORMANCE EVALUATION; WORMHOLE ROUTING;
D O I
10.1109/12.392844
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present simple methods to enhance the current minimal wormhole routing algorithms developed for high-radix, low-dimensional mesh networks for fault-tolerant routing, We consider arbitrarily-located faulty blocks and assume only local knowledge of faults, Messages are routed minimally when not blocked by faults and this constraint is relaxed to route around faults, The key concept we use is a fault ring consisting of fault-free nodes and links can be formed around each fault region, Our fault-tolerant techniques use these fault rings to route messages around fault regions, We show that, using just one extra virtual channel per physical channel, the well-known e-cube algorithm can be used to provide deadlock-free routing in networks with nonoverlapping fault rings; there is no restriction on the number of faults, For the more complex faults with overlapping fault rings, four virtual channels are used, We also prove that at most four additional virtual channels are sufficient to make fully-adaptive algorithms tolerant to multiple faulty blocks in n-dimensional meshes, All these algorithms are deadlock- and livelock-free. Further, we present simulation results for the e-cube and a fully-adaptive algorithm fortified with our fault-tolerant routing techniques and show that good performance may be obtained with as many as 10% links faulty.
引用
收藏
页码:848 / 864
页数:17
相关论文
共 33 条
[1]  
AGARWAL A, 1991, P WORKSHOP SCALABLE
[2]  
BHUYAN LN, 1984, IEEE T COMPUTERS, V33
[3]  
BOLDING K, 1991, 1991 P IEEE INT WORK, P124
[4]  
BOPPANA RV, 1993, 20TH P ANN INT S COM, P351
[5]  
Borkar S., 1988, Proceedings. Supercomputing '88 (IEEE Cat. No.88CH2617-9), P330, DOI 10.1109/SUPERC.1988.44670
[6]  
CHALASANI S, 1994, 8TH P ACM INT C SUP
[7]  
Chen M.-S., 1990, IEEE Transactions on Parallel and Distributed Systems, V1, P152, DOI 10.1109/71.80143
[8]  
CHIEN AA, 1992, 19TH P ANN INT S COM, P268
[9]  
DALLY W, 1990, VLSI PARALLEL COMPUT, P140
[10]  
DALLY WJ, 1987, IEEE T COMPUT, V36, P547, DOI 10.1109/TC.1987.1676939