A PARALLEL BLOCK IMPLEMENTATION OF LEVEL-3 BLAS FOR MIMD VECTOR PROCESSORS

被引:8
作者
DAYDE, MJ [1 ]
DUFF, IS [1 ]
PETITET, A [1 ]
机构
[1] CERFACS,F-31057 TOULOUSE,FRANCE
来源
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE | 1994年 / 20卷 / 02期
关键词
LEVEL-3; BLAS; MATRIX-MATRIX KERNELS; PARALLELIZATION; VECTORIZATION;
D O I
10.1145/178365.174413
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We describe an implementation of Level-3 BLAS (Basic Linear Algebra Subprograms) based on the use of the matrix-matrix multiplication kernel (GEMM). Blocking techniques are used to express the BLAS in terms of operations involving triangular blocks and calls to GEMM. A principal advantage of this approach is that most manufacturers provide at least an efficient serial version of GEMM so that our implementation can capture a significant percentage of the computer performance. A parameter which controls the blocking allows an efficient exploitation of the memory hierarchy of the various target computers. Furthermore, this blocked version of Level-3 BLAS is naturally parallel. We present results on the ALLIANT FX/80, the CONVEX C220, the CRAY-2, and the IBM 3090/VF. For GEMM, we always use the manufacturer-supplied versions. For the operations dealing with triangular blocks, we use assembler or tuned Fortran (using loop-unrolling) codes, depending on the efficiency of the available libraries.
引用
收藏
页码:178 / 193
页数:16
相关论文
共 26 条
[1]  
AMESTOY PR, 1992, CERFACS TRPA9269 REP
[2]  
Anderson E., 1992, LAPACK USERS GUIDE
[3]  
BERGER P, 1991, CERFACS TRPA9154 REP
[4]  
Bischof C, 1987, SIAM J SCI STAT COMP, V8, P2
[5]  
Calahan D. A., 1986, Proceedings of the 1986 International Conference on Parallel Processing (Cat. No.86CH2355-6), P375
[6]   LEVEL-3 BLAS IN LU FACTORIZATION ON THE CRAY-2, ETA-10P, AND IBM 3090-200 VF [J].
DAYDE, MJ ;
DUFF, IS .
INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1989, 3 (02) :40-70
[7]  
DAYDE MJ, 1990, INT J SUPERCOMPUTER, V5, P92
[8]   IMPLEMENTING LINEAR ALGEBRA ALGORITHMS FOR DENSE MATRICES ON A VECTOR PIPELINE MACHINE [J].
DONGARRA, JJ ;
GUSTAVSON, FG ;
KARP, A .
SIAM REVIEW, 1984, 26 (01) :91-112
[9]   AN EXTENDED SET OF FORTRAN BASIC LINEAR ALGEBRA SUBPROGRAMS [J].
DONGARRA, JJ ;
DUCROZ, J ;
HAMMARLING, S ;
HANSON, RJ .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1988, 14 (01) :1-17
[10]   AN EXTENDED SET OF BASIC LINEAR ALGEBRA SUBPROGRAMS - MODEL IMPLEMENTATION AND TEST PROGRAMS [J].
DONGARRA, JJ ;
DUCROZ, J ;
HAMMARLING, S ;
HANSON, RJ .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1988, 14 (01) :18-32