IMPLEMENTATION OF THE LEVEL-2 AND LEVEL-3 BLAS ON THE CRAY Y-MP AND THE CRAY-2

被引:3
作者
SHEIKH, Q [1 ]
VU, PO [1 ]
YANG, C [1 ]
MERCHANT, M [1 ]
机构
[1] RICE UNIV,MATH SCI,HOUSTON,TX 77251
关键词
BLAS; CODE OPTIMIZATION; CRAY ARCHITECTURE; VECTOR PROCESSING;
D O I
10.1007/BF00127950
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we discuss code optimization techniques for implementing the Level 2 and 3 basic linear algebra subprograms on a single processor for the CRAY Y-MP and the CRAY-2. Our performance measurements show that the use of these techniques leads to a significant improvement in performance, and most subroutines achieve close to the peak performance of the machine for computations of relatively small sizes.
引用
收藏
页码:291 / 305
页数:15
相关论文
共 11 条
[1]   IMPLEMENTING LINEAR ALGEBRA ALGORITHMS FOR DENSE MATRICES ON A VECTOR PIPELINE MACHINE [J].
DONGARRA, JJ ;
GUSTAVSON, FG ;
KARP, A .
SIAM REVIEW, 1984, 26 (01) :91-112
[2]   AN EXTENDED SET OF FORTRAN BASIC LINEAR ALGEBRA SUBPROGRAMS [J].
DONGARRA, JJ ;
DUCROZ, J ;
HAMMARLING, S ;
HANSON, RJ .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1988, 14 (01) :1-17
[3]   AN EXTENDED SET OF BASIC LINEAR ALGEBRA SUBPROGRAMS - MODEL IMPLEMENTATION AND TEST PROGRAMS [J].
DONGARRA, JJ ;
DUCROZ, J ;
HAMMARLING, S ;
HANSON, RJ .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1988, 14 (01) :18-32
[4]  
DONGARRA JJ, 1990, ACM T MATH SOFTWARE, V16, P1, DOI 10.1145/77626.79170
[5]  
DONGARRA JJ, 1984, ACM T MATH SOFTWARE, V10, P221
[6]  
FONG K, 1977, LA6774 LOS AL NAT LA
[7]  
Lawson C. L., 1979, ACM Transactions on Mathematical Software, V5, P324, DOI [10.1145/355841.355847, 10.1145/355841.355848]
[8]  
SHEIKH Q, 1989, CRAY CHANNELS, P24
[9]  
SHEIKH Q, 1989, IMPLEMENTATION LEVEL
[10]  
SHEIKH Q, 1989, 2ND C VECT PAR PROC