Implementing molecular dynamics on hybrid high performance computers - Particle-particle particle-mesh

被引:393
作者
Brown, W. Michael [1 ]
Kohlmeyer, Axel [2 ]
Plimpton, Steven J. [3 ]
Tharrington, Arnold N. [1 ]
机构
[1] Oak Ridge Natl Lab, Natl Ctr Computat Sci, Oak Ridge, TN 37831 USA
[2] Temple Univ, Inst Computat Mol Sci, Philadelphia, PA 19122 USA
[3] Sandia Natl Labs, Albuquerque, NM USA
基金
美国国家科学基金会;
关键词
Molecular dynamics; Electrostatics; Particle mesh; GPU; Hybrid parallel computing; EWALD SUMS;
D O I
10.1016/j.cpc.2011.10.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The use of accelerators such as graphics processing units (GPUs) has become popular in scientific computing applications due to their low cost, impressive floating-point capabilities, high memory bandwidth, and low electrical power requirements. Hybrid high-performance computers, machines with nodes containing more than one type of floating-point processor (e.g. CPU and GPU), are now becoming more prevalent due to these advantages. In this paper, we present a continuation of previous work implementing algorithms for using accelerators into the LAMMPS molecular dynamics software for distributed memory parallel hybrid machines. In our previous work, we focused on acceleration for short-range models with an approach intended to harness the processing power of both the accelerator and (multi-core) CPUs. To augment the existing implementations, we present an efficient implementation of long-range electrostatic force calculation for molecular dynamics. Specifically, we present an implementation of the particle-particle particle-mesh method based on the work by Harvey and De Fabritiis. We present benchmark results on the Keeneland InfiniBand GPU cluster. We provide a performance comparison of the same kernels compiled with both CUDA and OpenCL. We discuss limitations to parallel efficiency and future directions for improving performance on hybrid or heterogeneous computers. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:449 / 459
页数:11
相关论文
共 20 条
[1]   Implementing molecular dynamics on hybrid high performance computers - short range forces [J].
Brown, W. Michael ;
Wang, Peng ;
Plimpton, Steven J. ;
Tharrington, Arnold N. .
COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (04) :898-911
[2]  
Danalis A., 2010, P 3 WORKSH GEN PURP, P63, DOI [10.1145/1735688.1735702, DOI 10.1145/1735688.1735702]
[3]   PARTICLE MESH EWALD - AN N.LOG(N) METHOD FOR EWALD SUMS IN LARGE SYSTEMS [J].
DARDEN, T ;
YORK, D ;
PEDERSEN, L .
JOURNAL OF CHEMICAL PHYSICS, 1993, 98 (12) :10089-10092
[4]   How to mesh up Ewald sums. II. An accurate error estimate for the particle-particle-particle-mesh algorithm [J].
Deserno, M ;
Holm, C .
JOURNAL OF CHEMICAL PHYSICS, 1998, 109 (18) :7694-7701
[5]   How to mesh up Ewald sums. I. A theoretical and numerical comparison of various particle mesh routines [J].
Deserno, M ;
Holm, C .
JOURNAL OF CHEMICAL PHYSICS, 1998, 109 (18) :7678-7693
[6]   A SMOOTH PARTICLE MESH EWALD METHOD [J].
ESSMANN, U ;
PERERA, L ;
BERKOWITZ, ML ;
DARDEN, T ;
LEE, H ;
PEDERSEN, LG .
JOURNAL OF CHEMICAL PHYSICS, 1995, 103 (19) :8577-8593
[7]  
Ewald PP, 1921, ANN PHYS-BERLIN, V64, P253
[8]   Is the Ewald summation still necessary? Pairwise alternatives to the accepted standard for long-range electrostatics [J].
Fennell, Christopher J. ;
Gezelter, J. Daniel .
JOURNAL OF CHEMICAL PHYSICS, 2006, 124 (23)
[9]  
Ganesan N., 2011, IEEE INT PAR DISTR P, P467
[10]   Multilevel summation of electrostatic potentials using graphics processing units [J].
Hardy, David J. ;
Stone, John E. ;
Schulten, Klaus .
PARALLEL COMPUTING, 2009, 35 (03) :164-177