A feedforward architecture accounts for rapid categorization

被引:611
作者
Serre, Thomas
Oliva, Aude
Poggio, Tomaso
机构
[1] MIT, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
[2] MIT, McGovern Inst Brain Res, Cambridge, MA 02139 USA
[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
关键词
object recognition; computational model; visual cortex; natural scenes; preattentive vision;
D O I
10.1073/pnas.0700622104
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Primates are remarkably good at recognizing objects. The level of performance of their visual system and its robustness to image degradations still surpasses the best computer vision systems despite decades of engineering effort. In particular, the high accuracy of primates in ultra rapid object categorization and rapid serial visual presentation tasks is remarkable. Given the number of processing stages involved and typical neural latencies, such rapid visual processing is likely to be mostly feedforward. Here we show that a specific implementation of a class of feedforward theories of object recognition (that extend the Hubel and Wiesel simple-to-complex cell hierarchy and account for many anatomical and physiological constraints) can predict the level and the pattern of performance achieved by humans on a rapid masked animal vs. non-animal categorization task.
引用
收藏
页码:6424 / 6429
页数:6
相关论文
共 57 条