The performance of a supercomputer built with commodity components

被引:6
作者
Deng, YF [1 ]
Korobka, A [1 ]
机构
[1] SUNY Stony Brook, Ctr Comp Sci, Stony Brook, NY 11794 USA
基金
美国国家科学基金会;
关键词
Intel Pentium-based cluster; network computing; NAS and LINPACK; build your own supercomputer;
D O I
10.1016/S0167-8191(00)00090-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We built a supercomputer called Galaxy by connecting Intel Pentium-based computer nodes with Fast and Gigabit Ethernet switches. Each node has two processors at clock speeds varying from 300 to 600 MHz, up to 512 MB of memory, and small 2 Gb local disk. All nodes run the standard RedHat Linux and inter-node communication is handled by a message passing interface called MPI. Local tools are written to visualize the system performance and to balance loads. We have benchmarked a sub-Galaxy with 72 processors by NAS and Parallel LINPACK benchmark suites. We achieved 16.9 Gflops in a standard single precision LU decomposition for 46848 x 46838 matrix parallel LINPACK benchmark. A Galaxy with 128 processors costs approximately $250 000 and it delivers 40 Gflops of performance. This leads to a cost-performance ratio of 160 Kflops-per-dollar, which is to improve further due to increase in processor speeds and network bandwidth at similar cost. Our final system with 512 processors is expected to reach several Tflops. This article first describes the Galaxy architectural details, and then present and analyze its performance in terms of floating point number crunching, network bandwidth, and IO throughput. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:91 / 108
页数:18
相关论文
共 8 条
[1]  
BECKER DJ, 1995, P 1995 INT C PAR PRO, P11
[2]   Parallel simulated annealing by mixing of states [J].
Chu, KW ;
Deng, YF ;
Reinitz, J .
JOURNAL OF COMPUTATIONAL PHYSICS, 1999, 148 (02) :646-662
[3]   Prediction of protein binding to DNA in the presence of water-mediated hydrogen bonds [J].
Deng, YF ;
Glimm, J ;
Wang, Y ;
Korobka, A ;
Eisenberg, M ;
Grollman, AP .
JOURNAL OF MOLECULAR MODELING, 1999, 5 (7-8) :125-133
[4]  
GLIMM J, 1981, ADV APPL MATH, V2, P91, DOI 10.1016/0196-8858(81)90040-3
[5]   Parallel particle simulations of thin-film deposition [J].
McCoy, RA ;
Deng, YF .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 1999, 13 (01) :16-32
[6]  
SALTZ D, 1999, UNPUB INT J MULTI PH
[7]  
SNELL Q, 1996, IASTED INT C INT INF
[8]  
STORAGE COMPUTER RAI