Implementing molecular dynamics on hybrid high performance computers - short range forces

被引:554
作者
Brown, W. Michael [1 ]
Wang, Peng [2 ]
Plimpton, Steven J. [3 ]
Tharrington, Arnold N. [1 ]
机构
[1] Oak Ridge Natl Lab, Natl Ctr Computat Sci, Oak Ridge, TN 37831 USA
[2] NVIDIA, Santa Clara, CA USA
[3] Sandia Natl Labs, Albuquerque, NM USA
关键词
Molecular dynamics; GPU; Hybrid parallel computing; SIMULATIONS; EFFICIENT; ACCELERATOR; ALGORITHMS;
D O I
10.1016/j.cpc.2010.12.021
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The use of accelerators such as graphics processing units (GPUs) has become popular in scientific computing applications due to their low cost, impressive floating-point capabilities, high memory bandwidth, and low electrical power requirements. Hybrid high-performance computers, machines with more than one type of floating-point processor, are now becoming more prevalent due to these advantages. In this work, we discuss several important issues in porting a large molecular dynamics code for use on parallel hybrid machines - (1) choosing a hybrid parallel decomposition that works on central processing units (CPUs) with distributed memory and accelerator cores with shared memory, (2) minimizing the amount of code that must be ported for efficient acceleration, (3) utilizing the available processing power from both multi-core CPUs and accelerators, and (4) choosing a programming model for acceleration. We present our solution to each of these issues for short-range force calculation in the molecular dynamics package LAMMPS, however, the methods can be applied in many molecular dynamics codes. Specifically, we describe algorithms for efficient short range force calculation on hybrid high-performance machines. We describe an approach for dynamic load balancing of work between CPU and accelerator cores. We describe the Geryon library that allows a single code to compile with both CUDA and OpenCL for use on a variety of accelerators. Finally, we present results on a parallel test cluster containing 32 Fermi GPUs and 180 CPU cores. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:898 / 911
页数:14
相关论文
共 34 条
[1]   General purpose molecular dynamics simulations fully implemented on graphics processing units [J].
Anderson, Joshua A. ;
Lorenz, Chris D. ;
Travesset, A. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2008, 227 (10) :5342-5359
[2]   A GENERALIZED GAY-BERNE INTERMOLECULAR POTENTIAL FOR BIAXIAL PARTICLES [J].
BERARDI, R ;
FAVA, C ;
ZANNONI, C .
CHEMICAL PHYSICS LETTERS, 1995, 236 (4-5) :462-468
[3]   Liquid crystal nanodroplets in solution [J].
Brown, W. Michael ;
Petersen, Matt K. ;
Plimpton, Steven J. ;
Grest, Gary S. .
JOURNAL OF CHEMICAL PHYSICS, 2009, 130 (04)
[4]  
Davis JE, 2009, LECT N BIOINFORMAT, V5462, P176, DOI 10.1007/978-3-642-00727-9_18
[5]   Efficient Nonbonded Interactions for Molecular Dynamics on a Graphics Processing Unit [J].
Eastman, Peter ;
Pande, Vijay S. .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2010, 31 (06) :1268-1272
[6]   Accelerating Molecular Dynamic Simulation on Graphics Processing Units [J].
Friedrichs, Mark S. ;
Eastman, Peter ;
Vaidyanathan, Vishal ;
Houston, Mike ;
Legrand, Scott ;
Beberg, Adam L. ;
Ensign, Daniel L. ;
Bruns, Christopher M. ;
Pande, Vijay S. .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (06) :864-872
[7]   MODIFICATION OF THE OVERLAP POTENTIAL TO MIMIC A LINEAR SITE-SITE POTENTIAL [J].
GAY, JG ;
BERNE, BJ .
JOURNAL OF CHEMICAL PHYSICS, 1981, 74 (06) :3316-3319
[8]   The impact of accelerator processors for high-throughput molecular modeling and simulation [J].
Giupponi, G. ;
Harvey, M. J. ;
De Fabritiis, G. .
DRUG DISCOVERY TODAY, 2008, 13 (23-24) :1052-1058
[9]   A new version of the fast multipole method for screened Coulomb interactions in three dimensions [J].
Greengard, LF ;
Huang, JF .
JOURNAL OF COMPUTATIONAL PHYSICS, 2002, 180 (02) :642-658
[10]  
Hampton S, 2010, P 2010 INT C HIGH PE