ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks using Angle Sensitive Pixels

被引:44
作者
Chen, Huaijin G. [1 ]
Jayasuriya, Suren [2 ]
Yang, Jiyue [2 ]
Stephen, Judy [2 ]
Sivaramakrishnan, Sriram [2 ]
Veeraraghavan, Ashok [1 ]
Molnar, Alyosha [2 ]
机构
[1] Rice Univ, Houston, TX 77251 USA
[2] Cornell Univ, Ithaca, NY 14853 USA
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
基金
美国国家科学基金会;
关键词
IMPLEMENTATION; FIELD;
D O I
10.1109/CVPR.2016.104
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning using convolutional neural networks (CNNs) is quickly becoming the state-of-the-art for challenging computer vision applications. However, deep learning's power consumption and bandwidth requirements currently limit its application in embedded and mobile systems with tight energy budgets. In this paper, we explore the energy savings of optically computing the first layer of CNNs. To do so, we utilize bio-inspired Angle Sensitive Pixels (ASPs), custom CMOS diffractive image sensors which act similar to Gabor filter banks in the V1 layer of the human visual cortex. ASPs replace both image sensing and the first layer of a conventional CNN by directly performing optical edge filtering, saving sensing energy, data bandwidth, and CNN FLOPS to compute. Our experimental results (both on synthetic data and a hardware prototype) for a variety of vision tasks such as digit recognition, object recognition, and face identification demonstrate up to 90% reduction in image sensor power consumption and 90% reduction in data bandwidth from sensor to CPU, while achieving similar performance compared to traditional deep learning pipelines.
引用
收藏
页码:903 / 912
页数:10
相关论文
共 55 条
[21]  
[Anonymous], 2014, COMPRESSING DEEP CON
[22]  
[Anonymous], T ELECT DEVICES
[23]  
[Anonymous], 2006, IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
[24]   Evaluating Open-Universe Face Identification on the Web [J].
Becker, Brian C. ;
Ortiz, Enrique G. .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, :904-911
[25]   AN ANALOG NEURAL NETWORK PROCESSOR WITH PROGRAMMABLE TOPOLOGY [J].
BOSER, BE ;
SACKINGER, E ;
BROMLEY, J ;
LECUN, Y ;
JACKEL, LD .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1991, 26 (12) :2017-2025
[26]  
Dosovitskiy A, 2015, PROC CVPR IEEE, P1538, DOI 10.1109/CVPR.2015.7298761
[27]  
Farabet Clement, 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, P878, DOI 10.1109/ICCVW.2009.5457611
[28]   OPTICAL IMPLEMENTATION OF THE HOPFIELD MODEL [J].
FARHAT, NH ;
PSALTIS, D ;
PRATA, A ;
PAEK, E .
APPLIED OPTICS, 1985, 24 (10) :1469-1475
[29]   Implementation of steerable spatiotemporal image filters on the focal plane [J].
Gruev, V ;
Etienne-Cummings, R .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 2002, 49 (04) :233-244
[30]   CCD polarization imaging sensor with aluminum nanowire optical filters [J].
Gruev, Viktor ;
Perkins, Rob ;
York, Timothy .
OPTICS EXPRESS, 2010, 18 (18) :19087-19094