STABILITY OF BLOCK ALGORITHMS WITH FAST LEVEL-3 BLAS

被引:38
作者
DEMMEL, JW
HIGHAM, NJ
机构
[1] UNIV CALIF BERKELEY,DIV COMP SCI,BERKELEY,CA 94720
[2] UNIV MANCHESTER,DEPT MATH,MANCHESTER M13 9PL,LANCS,ENGLAND
来源
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE | 1992年 / 18卷 / 03期
关键词
BACKWARD ERROR ANALYSIS; BLOCK ALGORITHM; ITERATIVE REFINEMENT; LAPACK; LEVEL-3; BLAS; LU FACTORIZATION; QR FACTORIZATION;
D O I
10.1145/131766.131769
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Block algorithms are becoming increasingly popular in matrix computations. Since their basic unit of data is a submatrix rather than a scalar, they have a higher level of granularity than point algorithms, and this makes them well suited to high-performance computers. The numerical stability of the block algorithms in the new linear algebra program library LAPACK is investigated here. It is shown that these algorithms have backward error analyses in which the backward error bounds are commensurate with the error bounds for the underlying level-3 BLAS (BLAS3). One implication is that the block algorithms are as stable as the corresponding point algorithms when conventional BLAS3 are used. A second implication is that the use of BLAS3 based on fast matrix multiplication techniques affects the stability only insofar as it increases the constant terms in the normwise backward error bounds. For linear equation solvers employing LU factorization, it is shown that fixed precision iterative refinement helps to mitigate the effect of the larger error constants. Despite the positive results presented here, not all plausible block algorithms are stable; we illustrate this with the example of LU factorization with block triangular factors and describe how to check a block algorithm for stability without doing a full error analysis.
引用
收藏
页码:274 / 291
页数:18
相关论文
共 36 条
[1]   SOLVING SPARSE LINEAR-SYSTEMS WITH SPARSE BACKWARD ERROR [J].
ARIOLI, M ;
DEMMEL, JW ;
DUFF, IS .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1989, 10 (02) :165-190
[2]  
Bai Z., 1989, International Journal of High Speed Computing, V1, P97, DOI 10.1142/S0129053389000068
[3]   USING STRASSEN ALGORITHM TO ACCELERATE THE SOLUTION OF LINEAR-SYSTEMS [J].
BAILEY, DH ;
LEE, K ;
SIMON, HD .
JOURNAL OF SUPERCOMPUTING, 1991, 4 (04) :357-371
[4]   THE WY REPRESENTATION FOR PRODUCTS OF HOUSEHOLDER MATRICES [J].
BISCHOF, C ;
VANLOAN, C .
SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1987, 8 (01) :S2-S13
[5]  
BISCHOF CH, 1989, MCSP1050989 ANL MATH
[7]  
Dahlquist G., 1974, NUMERICAL METHODS
[8]  
DEMMEL JW, 1988, LAPACK4 ANL MATH COM
[9]  
DEMMEL JW, 1987, ANL97 MATH COMP SCI
[10]  
DEMMEL JW, 1989, LAPACK14 U TENN DEP