High performance direct gravitational N-body simulations on graphics processing units II:: An implementation in CUDA

被引:123
作者
Belleman, Robert G. [1 ]
Bedorf, Jeroen
Portegies Zwart, Simon
机构
[1] Univ Amsterdam, Sect Computat Sci, Amsterdam, Netherlands
[2] Univ Amsterdam, Astron Inst Anton Pannekoek, Amsterdam, Netherlands
关键词
gravitation; stellar dynamics; methods : N-body simulation; methods : numerical;
D O I
10.1016/j.newast.2007.07.004
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We present the results of gravitational direct N-body simulations using the graphics processing unit (GPU) on a commercial NVIDIA GeForce 8800GTX designed for gaming computers. The force evaluation of the N-body problem is implemented in "Compute Unified Device Architecture" (CUDA) using the GPU to speedup the calculations. We tested the implementation on three different N-body codes: two direct N-body integration codes, using the 4th order predictor-corrector Hermite integrator with block time-steps, and one Barnes-Hut treecode, which uses a 2nd order leapfrog integration scheme. The integration of the equations of motions for all codes is performed on the host CPU. We find that for N > 512 particles the GPU outperforms the GRAPE-6Af, if some softening in the force calculation is accepted. Without softening and for very small integration time-steps the GRAPE still outperforms the GPU. We conclude that modern GPUs offer an attractive alternative to GRAPE-6Af special purpose hardware. Using the same time-step criterion, the total energy of the N-body system was conserved better than to one in 10(6) on the GPU, only about an order of magnitude worse than obtained with GRAPE-6Af. For N greater than or similar to 10(5) the 8800GTX outperforms the host CPU by a factor of about 100 and runs at about the same speed as the GRAPE-6Af. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:103 / 112
页数:10
相关论文
共 26 条
  • [1] DYNAMICAL EVOLUTION OF CLUSTERS OF GALAXIES .1.
    AARSETH, SJ
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1963, 126 (03) : 223 - 255
  • [2] [Anonymous], 1965, ELECTRONICS
  • [3] [Anonymous], 1998, SCI SIMULATIONS SPEC
  • [4] A HIERARCHICAL O(N-LOG-N) FORCE-CALCULATION ALGORITHM
    BARNES, J
    HUT, P
    [J]. NATURE, 1986, 324 (6096) : 446 - 449
  • [5] Brook for GPUs: Stream computing on graphics hardware
    Buck, I
    Foley, T
    Horn, D
    Sugerman, J
    Fatahalian, K
    Houston, M
    Hanrahan, P
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03): : 777 - 786
  • [6] ELSEN E, 2007, ARXIV ASTROPHYS
  • [7] FERNANDO R, 2004, GPU GEMS PROGRAMMING, V4
  • [8] HAMADA T, 2007, ARXIV ASTROPHYSICS
  • [9] Harris M., 2005, SIGGRAPH 2005 GPGPU COURSE, P1
  • [10] HEGGIE DC, 1986, LECT NOT PHYS, V267