Generalization to novel images in upright and inverted faces

被引:74
作者
Moses, Y
Ullman, S
Edelman, S
机构
[1] Department of Applied Mathematics and Computer Science, Weizmann Institute of Science
关键词
D O I
10.1068/p250443
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
An image of a face depends not only on its shape, but also on the viewpoint, illumination conditions, and facial expression. A face recognition system must overcome the changes in face appearance induced by these factors. Two related questions were investigated: the capacity of the human visual system to generalize the recognition of faces to novel images, and the level at which this generalization occurs. This problem was approached by comparing the identification and generalization capacity for upright and inverted faces. For upright faces, remarkably good generalization to novel conditions was found. For inverted faces, the generalization to novel views was significantly worse for both new illumination and viewpoint, although the performance on the training images was similar to that on the upright condition. The results indicate that at least some of the processes that support generalization across viewpoint and illumination are neither universal (because subjects did not generalize as easily for inverted faces as for upright ones) nor strictly object specific (because in upright faces nearly perfect generalization was possible from a single view, by itself insufficient for building a complete object-specific model). It is proposed that generalization in face recognition occurs at an intermediate level that is applicable to a class of objects, and that at this level upright and inverted faces initially constitute distinct object classes.
引用
收藏
页码:443 / 461
页数:19
相关论文
共 37 条
[1]  
ATTNEAVE F, 1967, MODELS PERCEPTION SP
[2]  
BASRI R, 1988, P IEEE INT C COMPUTE, P482
[4]   FROM PIECEMEAL TO CONFIGURATIONAL REPRESENTATION OF FACES [J].
CAREY, S ;
DIAMOND, R .
SCIENCE, 1977, 195 (4275) :312-314
[5]   AUTOMATIC EXTRACTION OF FACE-FEATURES [J].
CRAW, I ;
ELLIS, H ;
LISHMAN, JR .
PATTERN RECOGNITION LETTERS, 1987, 5 (02) :183-187
[6]   SPATIAL VISUAL CHANNELS IN THE FOURIER PLANE [J].
DAUGMAN, JG .
VISION RESEARCH, 1984, 24 (09) :891-&
[7]  
DAVIES GM, 1978, J APPL PSYCHOL, V92, P507
[8]  
ENDO M, 1982, TOHOKU PSYCHOL FOLIA, V4, P116
[9]  
ENDO M, 1986, ASPECTS FACE PROCESS, P53
[10]   EXTRACTING STRUCTURE FROM AN AFFINE VIEW OF A 3D POINT SET WITH ONE OR 2 BILATERAL SYMMETRIES [J].
FAWCETT, R ;
ZISSERMAN, A ;
BRADY, JM .
IMAGE AND VISION COMPUTING, 1994, 12 (09) :615-622