GRAPE-6A: A single-card GRAPE-6 for parallel PC-GRAPE cluster systems

被引:79
作者
Fukushige, T [1 ]
Makino, J
Kawai, A
机构
[1] Univ Tokyo, Coll Arts & Sci, Dept Gen Syst Studies, Tokyo 1538902, Japan
[2] Univ Tokyo, Sch Sci, Dept Astron, Tokyo 1330033, Japan
关键词
galaxies : star clusters; methods : n-body simulations; stellar dynamics;
D O I
10.1093/pasj/57.6.1009
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
In this paper, we describe the design and performance of GRAPE-6A, a special-purpose computer for gravitational many-body simulations. It was designed to be used with a PC cluster, in which each node has one GRAPE-6A. Such a configuration is particularly cost-effective in running parallel tree algorithms. Though the use of parallel tree algorithms was possible with the original GRAPE-6 hardware, it was not very cost-effective since a single GRAPE-6 board was still too fast and too expensive. Therefore, we designed GRAPE-6A as a single PCI card to minimize the reproduction cost and to optimize the computing speed. The peak performance is 130 Gflops for one GRAPE-6A board and 3.1 Tflops for our 24 node cluster. We describe the implementation of the tree, TreePM and individual timestep algorithms on both a single GRAPE-6A system and GRAPE-6A cluster. Using the tree algorithm on our 16-node GRAPE-6A system, we can complete a collisionless simulation with 100 million particles (8000 steps) within 10 days.
引用
收藏
页码:1009 / 1021
页数:13
相关论文
共 33 条
[1]  
Aarseth S. J., 2003, Gravitational N-Body Simulations
[2]   Star cluster simulations: The state of the art [J].
Aarseth, SJ .
CELESTIAL MECHANICS & DYNAMICAL ASTRONOMY, 1999, 73 (1-4) :127-137
[3]  
[Anonymous], 1998, SCI SIMULATIONS SPEC
[4]   Performance characteristics of TreePM codes [J].
Bagla, JS ;
Ray, S .
NEW ASTRONOMY, 2003, 8 (07) :665-677
[5]   A HIERARCHICAL O(N-LOG-N) FORCE-CALCULATION ALGORITHM [J].
BARNES, J ;
HUT, P .
NATURE, 1986, 324 (6096) :446-449
[6]   A MODIFIED TREE CODE - DONT LAUGH - IT RUNS [J].
BARNES, JE .
JOURNAL OF COMPUTATIONAL PHYSICS, 1990, 87 (01) :161-170
[7]  
BLACKSTONE D, 1997, P SUP
[8]   P4M: a parallel version of P3M [J].
Brieu, PP ;
Evrard, AE .
NEW ASTRONOMY, 2000, 5 (03) :163-180
[9]   MESH-REFINED P3M - A FAST ADAPTIVE N-BODY ALGORITHM [J].
COUCHMAN, HMP .
ASTROPHYSICAL JOURNAL, 1991, 368 (02) :L23-&
[10]   Convergence and scatter of cluster density profiles [J].
Diemand, J ;
Moore, B ;
Stadel, J .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2004, 353 (02) :624-632