METRIC INVARIANCE IN OBJECT RECOGNITION - A REVIEW AND FURTHER EVIDENCE

被引:88
作者
COOPER, EE [1 ]
BIEDERMAN, I [1 ]
HUMMEL, JE [1 ]
机构
[1] UNIV MINNESOTA, MINNEAPOLIS, MN 55455 USA
来源
CANADIAN JOURNAL OF PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE | 1992年 / 46卷 / 02期
关键词
D O I
10.1037/h0084317
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Phenomenologically, human shape recognition appears to be invariant with changes of orientation in depth (up to parts occlusion), position in the visual field, and size. Recent versions of template theories (e.g., Ullman, I989; Lowe, I987) assume that these invariances are achieved through the application of transformations such as rotation, translation, and scaling of the image so that it can be matched metrically to a stored template. Presumably, such transformations would require time for their execution. We describe recent priming experiments in which the effects of a prior brief presentation of an image on its subsequent recognition are assessed. The results of these experiments indicate that the invariance is complete: The magnitude of visual priming (as distinct from name or basic level concept priming) is not affected by a change in position, size, orientation in depth, or the particular lines and vertices present in the image, as long as representations of the same components can be activated. An implemented seven layer neural network model (Hummel & Biederman, I992) that captures these fundamental properties of human object recognition is described. Given a line drawing of an object, the model activates a viewpoint-invariant structural description of the object, specifying its parts and their interrelations. Visual priming is interpreted as a change in the connection weights for the activation of: a) cells, termed geon feature assemblies (GFAS), that conjoin the output of units that represent invariant, independent properties of a single geon and its relations (such as its type, aspect ratio, relations to other geons), or b) a change in the connection weights by which several GFAs activate a cell representing an object.
引用
收藏
页码:191 / 214
页数:24
相关论文
共 40 条
[1]  
Atkinson R.C, 1974, CONT DEV MATH PSYCHO, P242
[2]   ROLE OF VISUAL AND SEMANTIC CODES IN OBJECT NAMING [J].
BARTRAM, DJ .
COGNITIVE PSYCHOLOGY, 1974, 6 (03) :325-356
[3]   MENTAL SIZE SCALING EXAMINED [J].
BESNER, D ;
COLTHEART, M .
MEMORY & COGNITION, 1976, 4 (05) :525-531
[4]   PRIMING CONTOUR-DELETED IMAGES - EVIDENCE FOR INTERMEDIATE REPRESENTATIONS IN VISUAL OBJECT RECOGNITION [J].
BIEDERMAN, I ;
COOPER, EE .
COGNITIVE PSYCHOLOGY, 1991, 23 (03) :393-419
[5]   EVIDENCE FOR COMPLETE TRANSLATIONAL AND REFLECTIONAL INVARIANCE IN VISUAL OBJECT PRIMING [J].
BIEDERMAN, I ;
COOPER, EE .
PERCEPTION, 1991, 20 (05) :585-593
[6]   SIZE INVARIANCE IN VISUAL OBJECT PRIMING [J].
BIEDERMAN, I ;
COOPER, EE .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1992, 18 (01) :121-133
[7]   RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING [J].
BIEDERMAN, I .
PSYCHOLOGICAL REVIEW, 1987, 94 (02) :115-147
[8]   OBJECT RECOGNITION AND LATERALITY - NULL EFFECTS [J].
BIEDERMAN, I ;
COOPER, EE .
NEUROPSYCHOLOGIA, 1991, 29 (07) :685-694
[9]  
BRODY BA, 1978, BRAIN, V101, P307
[10]   SYMBOLIC REASONING AMONG 3-D MODELS AND 2-D IMAGES [J].
BROOKS, RA .
ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :285-348