共 82 条
Density-fitted singles and doubles coupled cluster on graphics processing units
被引:56
作者:
DePrince, A. Eugene, III
[1
,2
]
Kennedy, Matthew R.
[2
]
Sumpter, Bobby G.
[3
,4
]
Sherrill, C. David
[2
,5
]
机构:
[1] Florida State Univ, Dept Chem & Biochem, Tallahassee, FL 32306 USA
[2] Georgia Inst Technol, Ctr Computat Mol Sci & Technol, Sch Chem & Biochem, Atlanta, GA 30332 USA
[3] Oak Ridge Natl Lab, Comp Sci & Math Div, Oak Ridge, TN USA
[4] Oak Ridge Natl Lab, Ctr Nanophase Mat Sci, Oak Ridge, TN USA
[5] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
基金:
美国国家科学基金会;
关键词:
graphics processing units;
singles doubles coupled cluster;
density fitting;
FROZEN NATURAL ORBITALS;
QUANTUM-CHEMISTRY CALCULATIONS;
PLESSET PERTURBATION-THEORY;
OPTIMIZED VIRTUAL ORBITALS;
CORRELATED CALCULATIONS;
BASIS-SETS;
MULTICORE PARALLELIZATION;
CHOLESKY DECOMPOSITIONS;
APPROXIMATE INTEGRALS;
2-ELECTRON INTEGRALS;
D O I:
10.1080/00268976.2013.874599
中图分类号:
O64 [物理化学(理论化学)、化学物理学];
学科分类号:
070304 ;
081704 ;
摘要:
We adapt an algorithm for singles and doubles coupled cluster (CCSD) that uses density fitting or Cholesky decomposition (CD) in the construction and contraction of all electron repulsion integrals (ERIs) for use on heterogeneous compute nodes consisting of a multicore central processing unit (CPU) and at least one graphics processing unit (GPU). The use of approximate three-index ERIs ameliorates two of the major difficulties in designing scientific algorithms for GPUs: (1) the extremely limited global memory on the devices and (2) the overhead associated with data motion across the bus. For the benzene trimer described by an aug-cc-pVDZ basis set, the use of a single NVIDIA Tesla C2070 (Fermi) GPU accelerates a CD-CCSD computation by a factor of 2.1, relative to the multicore CPU-only algorithm that uses six highly efficient Intel Core i7-3930K CPU cores. The use of two Fermi GPUs provides an acceleration of 2.89, which is comparable to that observed when using a single NVIDIA Kepler K20c GPU (2.73).
引用
收藏
页码:844 / 852
页数:9
相关论文