Exploring the VLSI scalability of stream processors

被引:15
作者
Khailany, B [1 ]
Dally, WJ [1 ]
Rixner, S [1 ]
Kapasi, UJ [1 ]
Owens, JD [1 ]
Towles, B [1 ]
机构
[1] Stanford Univ, Comp Syst Lab, Stanford, CA 94305 USA
来源
NINTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS | 2003年
关键词
D O I
10.1109/HPCA.2003.1183534
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Stream processors are high-performance programmable processors optimized to run media applications. Recent work has shown these processors to be more area- and energy-efficient than conventional programmable architectures. This paper explores the scalability of stream architectures to future VLSI technologies where over a thousand floating-point units on a single chip will be feasible. Two techniques for increasing the number of ALUs in a stream processor are presented: intracluster and intercluster scaling. These scaling techniques are shown to be cost-efficient to tens of ALUs per cluster and to hundreds of arithmetic clusters. A 640-ALU stream processor with 128 clusters and 5 ALUs per cluster is shown to be feasible in 45 nanometer technology, sustaining over 300 GOPS on kernels and providing 15.3x of kernel speedup and 8.0x of application speedup over a 40-ALU stream processor with a 2% degradation in area per ALU and a 7% degradation in energy dissipated per ALU operation.
引用
收藏
页码:153 / 164
页数:12
相关论文
共 22 条
[1]  
Agarwal V, 2000, PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P248, DOI [10.1145/342001.339691, 10.1109/ISCA.2000.854395]
[2]  
[Anonymous], P 6 INT S HIGH PERF
[3]  
[Anonymous], THESIS U CALIFORNIA
[4]  
Asanovic K., 1998, Ph. D. Dissertation
[5]  
Caspi Eylon, 2001, P 3 WORKSH MED STREA, P21
[6]  
DALLY WJ, 1998, DIGITAL SYSTEMS ENG, P12
[7]  
HO R, 2001, P IEEE APR
[8]   A stereo machine for video-rate dense depth mapping and its new applications [J].
Kanade, T ;
Yoshida, A ;
Oda, K ;
Kano, H ;
Tanaka, M .
1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, :196-202
[9]  
Kapasi UJ, 2000, INT SYMP MICROARCH, P159, DOI 10.1109/MICRO.2000.898067
[10]   Imagine: Media processing with streams [J].
Khailany, B ;
Dally, WJ ;
Kapasi, UJ ;
Mattson, P ;
Namkoong, J ;
Owens, JD ;
Towles, B ;
Chang, A ;
Rixner, S .
IEEE MICRO, 2001, 21 (02) :35-46