Mesh partitioning for efficient use of distributed systems

被引:12
作者
Chen, J [1 ]
Taylor, VE [1 ]
机构
[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
关键词
mesh partitioning; simulated annealing; distributed systems;
D O I
10.1109/71.980027
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Mesh partitioning for homogeneous systems has been studied extensively; however, mesh partitioning for distributed systems is a relatively new area of research. To ensure efficient execution on a distributed system, the heterogeneities in the processor and network performance must be taken into consideration in the partitioning process; equal size subdomains and small cut set size, which results from conventional mesh partitioning, are no longer the primary goals. In this paper, we address various issues related to mesh partitioning for distributed systems. These issues include the metric used to compare different partitions, efficiency of the application executing on a distributed system, and the advantage, of exploiting heterogeneity in network performance. We present a tool called PART, for automatic mesh partitioning for distributed systems. The novel feature of PART is that it considers heterogeneities in the application and the distributed system. Simulated annealing is used in PART to perform the backtracking search for desired partitions. While it is well-known that simulated annealing is computationally intensive, we describe the parallel version of simulated annealing that is used with PART. The results of the parallelization exhibit superlinear speedup in most cases and nearly perfect speedup for the remaining cases. Experimental results are also presented for partitioning regular and irregular finite element meshes for an explicit, nonlinear finite element application, called WHAMS2D, executing on a distributed system consisting of two IBM SPs with different processors. The results from the regular problems indicate a:33 to 46 percent increase in efficiency when processor performance is considered as compared to the conventional even partitioning. The results indicate a 5 to 15 percent increase in efficiency when network performance is considered as compared to considering only processor performance; this is significant given that the optimal improvement is 15 percent for this: application. The results from the irregular problem indicate up to 36 percent increase in efficiency when processor and network performance are considered as compared to even partitioning.
引用
收藏
页码:67 / 79
页数:13
相关论文
共 40 条
[1]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[2]  
[Anonymous], 1993, MULTILEVEL ALGORITHM
[3]  
BANERJEE P, 1990, IEEE T PAR DISTR SYS, V1
[4]  
BARNARD ST, 1993, FAST MULTILEVEL IMPL
[5]  
BERGER MJ, 1987, IEEE T COMPUTERS, V36
[6]   An evaluation of parallel simulated annealing strategies with application to standard cell placement [J].
Chandy, JA ;
Kim, S ;
Ramkumar, B ;
Parkes, S ;
Banerjee, P .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1997, 16 (04) :398-410
[7]  
CHEN HC, 1991, 1112 CSRD U ILL
[8]  
Chen J, 1998, P 7 IEEE INT S HIGH
[9]  
CHEN J, 1999, P 6 INT WORKSH SOLV
[10]  
CRANALL PE, 1994, P ACM 1994 COMP SCI