How Does the Brain Solve Visual Object Recognition?

被引:1071
作者
DiCarlo, James J. [1 ,2 ]
Zoccolan, Davide [3 ,5 ]
Rust, Nicole C. [4 ]
机构
[1] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
[2] MIT, McGovern Inst Brain Res, Cambridge, MA 02139 USA
[3] Int Sch Adv Studies SISSA, Cognit Neurosci Sect, I-34136 Trieste, Italy
[4] Univ Penn, Dept Psychol, Philadelphia, PA 19104 USA
[5] Int Sch Adv Studies SISSA, Neurobiol Sect, I-34136 Trieste, Italy
基金
美国国家科学基金会;
关键词
INFERIOR TEMPORAL CORTEX; RECEPTIVE-FIELD PROPERTIES; INFEROTEMPORAL CORTEX; SHAPE SELECTIVITY; INDIVIDUAL NEURONS; FAMILIAR OBJECTS; SINGLE NEURONS; NEURAL CIRCUIT; CORTICAL AREAS; NATURAL IMAGES;
D O I
10.1016/j.neuron.2012.01.010
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Mounting evidence suggests that 'core object recognition,' the ability to rapidly recognize objects despite substantial appearance variation, is solved in the brain via a cascade of reflexive, largely feedforward computations that culminate in a powerful neuronal representation in the inferior temporal cortex. However, the algorithm that produces this solution remains poorly understood. Here we review evidence ranging from individual neurons and neuronal populations to behavior and computational models. We propose that understanding this algorithm will require using neuronal and psychophysical data to sift through many computational models, each based on building blocks of small, canonical subnetworks with a common functional goal.
引用
收藏
页码:415 / 434
页数:20
相关论文
共 191 条
[1]   Representational capacity of face coding in monkeys [J].
Abbott, LF ;
Rolls, ET ;
Tovee, MJ .
CEREBRAL CORTEX, 1996, 6 (03) :498-505
[2]   SPATIOTEMPORAL ENERGY MODELS FOR THE PERCEPTION OF MOTION [J].
ADELSON, EH ;
BERGEN, JR .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (02) :284-299
[3]   Retinotopy of the face aftereffect [J].
Afraz, Seyed-Reza ;
Cavanagh, Patrick .
VISION RESEARCH, 2008, 48 (01) :42-54
[4]   Microstimulation of inferotemporal cortex influences face categorization [J].
Afraz, Seyed-Reza ;
Kiani, Roozbeh ;
Esteky, Hossein .
NATURE, 2006, 442 (7103) :692-695
[5]   Scene perception: inferior temporal cortex neurons encode the positions of different objects in the scene [J].
Aggelopoulos, NC ;
Rolls, ET .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2005, 22 (11) :2903-2916
[6]  
[Anonymous], 1947, The neocortex of Macaca mulatta
[7]  
[Anonymous], 2009, P INT C COMP VIS ICC
[8]  
[Anonymous], 1990, VISUAL AGNOSIA DISOR
[9]  
[Anonymous], 1982, Visual perception
[10]   Top-down facilitation of visual recognition [J].
Bar, M ;
Kassam, KS ;
Ghuman, AS ;
Boshyan, J ;
Schmidt, AM ;
Dale, AM ;
Hämäläinen, MS ;
Marinkovic, K ;
Schacter, DL ;
Rosen, BR ;
Halgren, E .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (02) :449-454