A feedforward architecture accounts for rapid categorization

被引：611

作者：

Serre, Thomas

Oliva, Aude

Poggio, Tomaso

机构：

[1] MIT, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA

[2] MIT, McGovern Inst Brain Res, Cambridge, MA 02139 USA

[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA

来源：

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA | 2007年 / 104卷 / 15期

关键词：

object recognition; computational model; visual cortex; natural scenes; preattentive vision;

D O I：

10.1073/pnas.0700622104

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Primates are remarkably good at recognizing objects. The level of performance of their visual system and its robustness to image degradations still surpasses the best computer vision systems despite decades of engineering effort. In particular, the high accuracy of primates in ultra rapid object categorization and rapid serial visual presentation tasks is remarkable. Given the number of processing stages involved and typical neural latencies, such rapid visual processing is likely to be mostly feedforward. Here we show that a specific implementation of a class of feedforward theories of object recognition (that extend the Hubel and Wiesel simple-to-complex cell hierarchy and account for many anatomical and physiological constraints) can predict the level and the pattern of performance achieved by humans on a rapid masked animal vs. non-animal categorization task.

引用

页码：6424 / 6429

页数：6

共 57 条

[1] An integrated network for invariant visual detection and recognition
Amit, Y
Mascaro, M
[J]. VISION RESEARCH, 2003, 43 (19) : 2073 - 2088
[2] The time course of visual processing:: Backward masking and natural scene categorisation
Bacon-Macé, N
Macé, MJM
Fabre-Thorpe, M
Thorpe, SJ
[J]. VISION RESEARCH, 2005, 45 (11) : 1459 - 1469
[3] RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING
BIEDERMAN, I
[J]. PSYCHOLOGICAL REVIEW, 1987, 94 (02) : 115 - 147
[4] Bienenstock E, 1997, ADV NEUR IN, V9, P838
[5] The psychophysics toolbox
Brainard, DH
[J]. SPATIAL VISION, 1997, 10 (04): : 433 - 436
[6] BREITMEYER B, 2006, VISUAL MAKING TIME S
[7] FACE-SELECTIVE CELLS IN THE TEMPORAL CORTEX OF MONKEYS
DESIMONE, R
[J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 1991, 3 (01) : 1 - 8
[8] SPATIAL-FREQUENCY SELECTIVITY OF CELLS IN MACAQUE VISUAL-CORTEX
DEVALOIS, RL
ALBRECHT, DG
THORELL, LG
[J]. VISION RESEARCH, 1982, 22 (05) : 545 - 559
[9] Devroye L., 1996, A probabilistic theory of pattern recognition
[10] What's new in visual masking?
Enns, JT
Di Lollo, V
[J]. TRENDS IN COGNITIVE SCIENCES, 2000, 4 (09) : 345 - 352

← 1 2 3 4 5 6 →