Tiling nested loops into maximal rectangular blocks

被引:7
作者
Chen, YS
Wang, SD
Wang, CM
机构
[1] NATL TAIWAN UNIV,DEPT ELECT ENGN,TAIPEI 106,TAIWAN
[2] ACAD SINICA,INST INFORMAT SCI,TAIPEI 115,TAIWAN
关键词
D O I
10.1006/jpdc.1996.0075
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, an approach to thing nested loops for maximizing parallelism is proposed, The proposed method aims at aggregating independent computations of a loop nest into rectangular blocks and maximizing the block sizes for maximizing parallelism, At first, all the independent computations that can be executed in the first time unit are identified. These computations are called the initially independent computations, Then it is shown that all of them can be collected as a union of rectangular blocks, So, based on these, the entire iteration space of the loops is partitioned into rectangular blocks for maximizing parallelism, The proposed method is formulated as systematic procedures which can easily be implemented in a parallelizing compiler, It is shown that when the wavefront transformation is combined with the proposed method, the loops can always be tiled so that the tile size is greater than one. In comparison with previous work on thing, the proposed method is shown to have several advantages as summarized in the conclusions of this paper. (C) 1996 Academic Press, Inc.
引用
收藏
页码:123 / 132
页数:10
相关论文
共 27 条
[1]  
ANCOURT C, 1991, 3RD P ACM SIGPLAN S, P39
[2]  
BANERJEE U, 1988, DEPENDENCE ANAL SUPE
[3]   CONSTRUCTIVE METHODS FOR SCHEDULING UNIFORM LOOP NESTS [J].
DARTE, A ;
ROBERT, Y .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (08) :814-822
[4]   PARTITIONING AND LABELING OF LOOPS BY UNIMODULAR TRANSFORMATIONS [J].
DHOLLANDER, EH .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1992, 3 (04) :465-476
[5]  
FORTES JAB, 1984, AUG P INT C PAR PROC, P322
[6]   STRATEGIES FOR CACHE AND LOCAL MEMORY MANAGEMENT BY GLOBAL PROGRAM TRANSFORMATION [J].
GANNON, D ;
JALBY, W ;
GALLIVAN, K .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1988, 5 (05) :587-616
[7]  
IRIGOIN F, 1988, 15TH P ANN ACM SIGAC, P319
[8]  
IRIGOIN F, 1987, THESIS U PARIS 6 PAR
[9]  
KING C, 1989, P INT C PAR PROC, V2, P31
[10]   NUMBER OF OPERATIONS SIMULTANEOUSLY EXECUTABLE IN FORTRAN-LIKE PROGRAMS AND THEIR RESULTING SPEEDUP [J].
KUCK, DJ ;
MURAOKA, Y ;
CHEN, SC .
IEEE TRANSACTIONS ON COMPUTERS, 1972, C 21 (12) :1293-1310