ACE16k:: The third generation of mixed-signal SIMD-CNN ACE chips toward VSoCs

被引:161
作者
Rodríguez-Vázquez, A [1 ]
Liñán-Cembrano, G [1 ]
Carranza, L [1 ]
Roca-Moreno, E [1 ]
Carmona-Galán, R [1 ]
Jiménez-Garrido, F [1 ]
Domínguez-Castro, R [1 ]
Meana, SE [1 ]
机构
[1] Univ Seville, CNM, IMSE, E-41012 Seville, Spain
关键词
analog programmable very large-scale integration; (VLSI); early vision chips; silicon retinas;
D O I
10.1109/TCSI.2004.827621
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Today, with 0.18-mum technologies mature and stable enough for mixed-signal design with a large variety of CMOS compatible optical sensors available and with 0.09-mum technologies knocking at the door of designers, we can face the design of integrated systems, instead of just integrated circuits. In fact, significant progress has been made in the last few years toward the realization of vision systems on chips (VSoCs). Such VSoCs are eventually targeted to integrate within a semiconductor substrate the functions of optical sensing, image processing in space and time, high-level processing, and the control of actuators. The consecutive generations of ACE chips define a roadmap toward flexible VSoCs. These chips consist of arrays of mixed-signal processing elements (PEs) which operate in accordance with single instruction multiple data (SIMD) computing architectures and exhibit the functional features of CNN Universal Machines. They have been conceived to cover the early stages of the visual processing path in a fully parallel manner, and hence more efficiently than DSP-based systems. Across the different generations, different improvements and modifications have been made looking to converge with the newest discoveries of neurobiologists regarding the behavior of natural retinas. This paper presents considerations pertaining to the design of a member of the third generation of ACE chips, namely to the so-called ACE16k chip. This chip, designed in a 0.35-mum standard CMOS technology, contains about 3.75 million transistors and exhibits peak computing figures of 330 GOPS, 3.6 GOPS/mm(2) and 82.5 GOPS/W. Each PE in the array contains a reconfigurable computing kernel capable of calculating linear convolutions on 3 x 3 neighborhoods in less than 1.5 mus, imagewise Boolean combinations in less than 200 ns, imagewise arithmetic operations in about 5 mus, and CNN-like temporal evolutions with a time constant of about 0.5 mus. Unfortunately, the many ideas underlying the design of this chip cannot be covered in a single paper; hence, this paper is focused on, first, placing the ACE16k in the ACE chip roadmap and, then, discussing the most significant modifications of ACE16K versus its predecessors in the family.
引用
收藏
页码:851 / 863
页数:13
相关论文
共 19 条
[1]  
[Anonymous], IEEE T NEURAL NETWOR
[2]  
[Anonymous], 2002, INT TECHNOLOGY ROADM
[3]  
[Anonymous], 1999, HDB COMPUTER VISION
[4]   A 0.8-mu m CMOS two-dimensional programmable mixed-signal focal-plane array processor with on-chip binary imaging and instructions storage [J].
DominguezCastro, R ;
Espejo, S ;
RodriguezVazquez, A ;
Carmona, RA ;
Foldesy, P ;
Zarandy, A ;
Szolgay, P ;
Sziranyi, T ;
Roska, T .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1997, 32 (07) :1013-1026
[5]  
DUDEK P, 2000, THESIS U MANCHESTER
[6]  
Hastings A., 2001, ART ANALOG LAYOUT
[7]  
HUGHES JB, 1993, 1993 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS : PROCEEDINGS, VOLS 1-4 ( ISCAS 93 ), P1235
[8]   ACE4k:: An analog I/O 64 x 64 visual microprocessor chip with 7-bit analog accuracy [J].
Liñán, G ;
Espejo, S ;
Domínguez-Castro, R ;
Rodríguez-Vázquez, A .
INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2002, 30 (2-3) :89-116
[9]  
LINAN G, 2002, THESIS U SEVILLE SEV
[10]  
LINAN G, 2003, P SPIE C EL IM SANT