VECTORIZING MATRIX OPERATIONS ARISING FROM PDE DISCRETIZATION ON 9-POINT STENCILS

被引:3
作者
BLOM, JG
VERWER, JG
机构
[1] CWI, Amsterdam, 1090 GB
关键词
VECTORIZATION; NONSYMMETRIC SPARSE LINEAR SYSTEMS; NONRECTANGULAR DOMAIN; MATRIX-VECTOR MULTIPLICATION; ILU PRECONDITIONING; HYPERPLANE METHOD;
D O I
10.1007/BF01666907
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
When solving a system of PDEs, discretized on 9-point stencils over a nonrectangular domain, the linear systems that arise will have matrices with an irregular block structure. In this paper we discuss the vectorization of the matrix-vector multiply and of the Incomplete LU factorization and backsolve for these types of matrices. The performance of the matrix-vector multiply is already optimal for a small number of grid points (one result per clock cycle). For the ILU factorization and backsolve the vector performance is not as satisfying, partly because the resulting vector length is generally small and partly because of the heavy use of indirect addressing. A comparison with the general-purpose routines from the SLAP library shows a significant gain in computational time.
引用
收藏
页码:29 / 51
页数:23
相关论文
共 15 条
[1]   ON VECTORIZING INCOMPLETE FACTORIZATION AND SSOR PRECONDITIONERS [J].
ASHCRAFT, CC ;
GRIMES, RG .
SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1988, 9 (01) :122-151
[2]  
BLOM JG, 1993, INT J COMPUT FLUID D, V1, P95
[3]   DISTRIBUTION OF MATHEMATICAL SOFTWARE VIA ELECTRONIC MAIL [J].
DONGARRA, JJ ;
GROSSE, E .
COMMUNICATIONS OF THE ACM, 1987, 30 (05) :403-407
[4]  
FUJINO S, 1991, APR P IMACS INT S IT, P349
[5]  
Gustafsson I., 1978, BIT (Nordisk Tidskrift for Informationsbehandling), V18, P142, DOI 10.1007/BF01931691
[6]  
SAAD Y, 1986, SIAM J SCI STAT COMP, V7, P856, DOI 10.1137/0907058
[7]   CGS, A FAST LANCZOS-TYPE SOLVER FOR NONSYMMETRIC LINEAR-SYSTEMS [J].
SONNEVELD, P .
SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1989, 10 (01) :36-52
[8]   COMPUTING BRINE TRANSPORT IN POROUS-MEDIA WITH AN ADAPTIVE-GRID METHOD [J].
TROMPERT, RA ;
VERWER, JG ;
BLOM, JG .
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 1993, 16 (01) :43-63
[9]   ANALYSIS OF THE IMPLICIT EULER LOCAL UNIFORM GRID REFINEMENT METHOD [J].
TROMPERT, RA ;
VERWER, JG .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1993, 14 (02) :259-278
[10]  
TROMPERT RA, 1993, MATH COMPUT, V60, P591, DOI 10.1090/S0025-5718-1993-1181332-3