The effect of LUT and cluster size on deep-submicron FPGA performance and density

被引:183
作者
Ahmed, E [1 ]
Rose, J [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada
关键词
architecture; clusters; computer-aided design (CAD); field-programmable gate-array (FPGA); look-up table (LUT); very large scale integration (VLSI);
D O I
10.1109/TVLSI.2004.824300
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we revisit the field-programmable gate-array (FPGA) architectural issue of the effect of logic block functionality on FPGA performance and density. In particular, in the context of lookup table, cluster-based island-style FPGAs (Betz et aL 1997) we look at the effect of lookup table (LUT) size and cluster size (number of LUTs per cluster) on the speed and logic density of an FPGA. We use a fully timing-driven experimental flow (Betz et al. 1997), (Marquardt, 1999) in which a set of benchmark circuits are synthesized into different cluster-based (Betz and Rose, 1997, 1998) and (Marquardt, 1999) logic block architectures, which contain groups of LUTs and flip-flops. Across all architectures with LUT sizes in the range of 2 to 7 inputs, and cluster size from 1 to 10 LUTs, we have experimentally determined the relationship between the number of inputs required for a cluster as a function of the LUT size (K) and cluster size (N). Second, contrary to previous results, we have shown that clustering small LUTs (sizes 2 and 3) produces better area results than what was presented in the past. However, our results also show that the performance of FPGAs with these small LUT sizes is significantly worse (by almost a factor of 2) than larger LUTs. Hence, as measured by area-delay product, or by performance, these would be a bad choice. Also, we have discovered that LUT sizes of 5 and 6 produce much better area results than were previously believed. Finally, our results show that a LUT size of 4 to 6 and cluster size of between 3-10 provides the best area-delay product for an FPGA.
引用
收藏
页码:288 / 298
页数:11
相关论文
共 30 条
[1]  
AGRAWAL O, 1999, P IEEE FIELD PROGR G, P17
[2]  
AHMED E, 2000, P ACM SIGDA INT S FI, P3
[3]  
[Anonymous], THESIS U TORONTO TOR
[4]   Cluster-based logic blocks for FPGAs: Area-efficiency vs. input sharing and size [J].
Betz, V ;
Rose, J .
PROCEEDINGS OF THE IEEE 1997 CUSTOM INTEGRATED CIRCUITS CONFERENCE, 1997, :551-554
[5]  
Betz V., 1999, Architecture and CAD for Deep-Submicron FPGAs
[6]  
BETZ V, 1998, IEEE DESIGN TEST SPR, P10
[7]   FPGA and CPLD architectures: A tutorial [J].
Brown, S ;
Rose, J .
IEEE DESIGN & TEST OF COMPUTERS, 1996, 13 (02) :42-57
[8]  
Brown S. D., 1992, FIELD PROGRAMMABLE G
[9]  
Chung K. C. K., 1994, THESIS U TORONTO TOR THESIS U TORONTO TOR
[10]   FLOWMAP - AN OPTIMAL TECHNOLOGY MAPPING ALGORITHM FOR DELAY OPTIMIZATION IN LOOKUP-TABLE BASED FPGA DESIGNS [J].
CONG, J ;
DING, YH .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1994, 13 (01) :1-12