Dissociating stimulus information from internal representation - a case study in object recognition

被引:13
作者
Liu, ZL
Kersten, D
Knill, DC
机构
[1] NEC Res Inst, Princeton, NJ 08540 USA
[2] Univ Minnesota, Dept Psychol, Minneapolis, MN 55455 USA
[3] Univ Penn, Dept Psychol, Philadelphia, PA 19104 USA
关键词
object recognition; representation; ideal observer; image information; object complexity;
D O I
10.1016/S0042-6989(98)00167-9
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Human object recognition is a function of both internal memory representation(s) and stimulus input information. The role of the latter has been so far largely overlooked, and the nature of the representation is often directly equated with recognition performance. We quantify stimulus information for three classes of objects in order of decreasing object complexity: unconnected balls, balls connected with lines, and balls connected with cylinders. In an object discrimination task, subjects' performance improved with the decreasing object complexity. We show that input information also increases with decreasing object complexity. Therefore, the results could potentially be accounted for either by differences in the object representations learned for each class of objects, or by the increased information about the three-dimensional (3D) structure inherent in images of the less complex objects, or by both. We demonstrate that, when image information is taken into account, by computing efficiencies relative to a set of ideal observers, subjects were more efficient in recognizing the less complex objects. This suggests that differences in subjects' performance for different object classes is at least partly a function of the internal representations learned for the different object classes. We stress that this conclusion cannot be achieved without the quantitative analysis of stimulus input information. (C) 1998 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:603 / 612
页数:10
相关论文
共 12 条
[1]   VIEWPOINT-DEPENDENT MECHANISMS IN VISUAL OBJECT RECOGNITION - REPLY TO TARR AND BULTHOFF (1995) [J].
BIEDERMAN, I ;
GERHARDSTEIN, PC .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1995, 21 (06) :1506-1514
[2]  
FARAH MJ, 1994, COGNITIVE SCI, V18, P325, DOI 10.1207/s15516709cog1802_4
[3]  
Hays W. L., 1988, STATISTICS
[4]   VISUAL-PERCEPTION OF BIOLOGICAL MOTION AND A MODEL FOR ITS ANALYSIS [J].
JOHANSSON, G .
PERCEPTION & PSYCHOPHYSICS, 1973, 14 (02) :201-211
[5]   OBJECT CLASSIFICATION FOR HUMAN AND IDEAL OBSERVERS [J].
LIU, ZL ;
KNILL, DC ;
KERSTEN, D .
VISION RESEARCH, 1995, 35 (04) :549-568
[6]  
Liu ZL, 1996, SPATIAL VISION, V9, P491, DOI 10.1163/156856896X00222
[7]   IS HUMAN OBJECT RECOGNITION BETTER DESCRIBED BY GEON STRUCTURAL DESCRIPTIONS OR BY MULTIPLE VIEWS - COMMENT ON BIEDERMAN AND GERHARDSTEIN (1993) [J].
TARR, MJ ;
BULTHOFF, HH .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1995, 21 (06) :1494-1505
[8]   The viewpoint complexity of an object-recognition task [J].
Tjan, BS ;
Legge, GE .
VISION RESEARCH, 1998, 38 (15-16) :2335-2350
[9]   HUMAN-EFFICIENCY FOR RECOGNIZING 3-D OBJECTS IN LUMINANCE NOISE [J].
TJAN, BS ;
BRAJE, WL ;
LEGGE, GE ;
KERSTEN, D .
VISION RESEARCH, 1995, 35 (21) :3053-3069
[10]   INTERPRETATION OF STRUCTURE FROM MOTION [J].
ULLMAN, S .
PROCEEDINGS OF THE ROYAL SOCIETY SERIES B-BIOLOGICAL SCIENCES, 1979, 203 (1153) :405-426