A general parallel sparse-blocked matrix multiply for linear scaling SCF theory

被引:78
作者
Challacombe, M [1 ]
机构
[1] Los Alamos Natl Lab, Div Theoret, Grp T12, Los Alamos, NM 87545 USA
关键词
D O I
10.1016/S0010-4655(00)00074-6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A general approach to the parallel sparse-blocked matrix-matrix multiply is developed in the context of linear scaling self-consistent-field (SCF) theory. The data-parallel message passing method uses non-blocking communication to overlap computation and communication. The space filling curve heuristic is used to achieve data locality for sparse matrix elements that decay with "separation". Load balance is achieved by solving the bin packing problem for blocks with variable size. With this new method as the kernel, parallel performance of the simplified density matrix minimization (SDMM) for solution of the SCF equations is investigated for RHF/6-31G** water clusters and RHF/3-21G estane globules. Sustained rates above 5.7 GFLOPS for the SDMM have been achieved for (H2O)(200) with 95 Origin 2000 processors. Scalability is found to be limited by load imbalance, which increases with decreasing granularity, due primarily to the inhomogeneous distribution of variable block sizes. Published by Elsevier Science B.V.
引用
收藏
页码:93 / 107
页数:15
相关论文
共 106 条