USING PROCESSOR AFFINITY IN LOOP SCHEDULING ON SHARED-MEMORY MULTIPROCESSORS

被引:85
作者
MARKATOS, EP [1 ]
LEBLANC, TJ [1 ]
机构
[1] UNIV ROCHESTER,DEPT COMP SCI,ROCHESTER,NY 14627
基金
美国国家科学基金会;
关键词
LOOP SCHEDULING; SYNCHRONIZATION; LOAD IMBALANCE; COMMUNICATION OVERHEAD; SHARED-MEMORY MULTIPROCESSORS;
D O I
10.1109/71.273046
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Loops are the single largest source of parallelism in many applications. One way to exploit this parallelism is to execute loop iterations in parallel on different processors. Previous approaches to loop scheduling attempted to achieve the minimum completion time by distributing the workload as evenly as possible while minimizing the number of synchronization operations required. In this paper, we consider a third dimension to the problem of loop scheduling on shared-memory multiprocessors: communication overhead caused by accesses to nonlocal data. We show that traditional algorithms for loop scheduling, which ignore the location of data when assigning iterations to processors, incur a significant performance penalty on modern shared-memory multiprocessors. We propose a new loop scheduling algorithm that attempts to simultaneously balance the workload, minimize synchronization, and co-locate loop iterations with the necessary data. We compare the performance of this new algorithm to other known algorithms by using five representative kernel programs on a Silicon Graphics multiprocessor work-station, a BBN Butterfly, a Sequent Symmetry, and a KSR-1, and show that the new algorithm offers substantial performance improvements, up to a factor of 4 in some cases. We conclude that loop scheduling algorithms for shared-memory multiprocessors cannot afford to ignore the location of data, particularly in light of the increasing disparity between processor and memory speeds.
引用
收藏
页码:379 / 400
页数:22
相关论文
共 32 条
[1]   THE PERFORMANCE IMPLICATIONS OF THREAD MANAGEMENT ALTERNATIVES FOR SHARED-MEMORY MULTIPROCESSORS [J].
ANDERSON, TE ;
LAZOWSKA, ED ;
LEVY, HM .
IEEE TRANSACTIONS ON COMPUTERS, 1989, 38 (12) :1631-1644
[2]  
BERSHAD BN, 1988, 1988 P ACM SIGPLAN P, P1
[3]  
Bokhari SH., 1987, ASSIGNMENT PROBLEMS
[4]  
BOLOSKY W, 1989, 12TH P ACM S OP SYST, P19
[5]  
BOLOSKY WJ, 1991, 4TH P INT C ARCH SUP, P212
[6]  
COX AL, 1989, 12TH P ACM S OP SYST, P32
[7]  
CROVELLA M, 1991, 3RD P IEEE S PAR DIS, P500
[8]  
DANDAMUDI S, 1991, 3RD P IEEE S PAR DIS, P423
[9]  
DOEPPNER TW, 1987, CS8711 BROWN U DEP C
[10]  
Eager D. L., 1992, 920101 U WASH DEP CO