Classification of faces in man and machine

被引:33
作者
Graf, ABA [1 ]
Wichmann, FA [1 ]
Bülthoff, HH [1 ]
Schölkopf, B [1 ]
机构
[1] Max Planck Inst Biol Cybernet, D-72076 Tubingen, Germany
关键词
D O I
10.1162/089976606774841611
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
We attempt to shed light on the algorithms humans use to classify images of human faces according to their gender. For this, a novel methodology combining human psychophysics and machine learning is introduced. We proceed as follows. First, we apply principal component analysis (PCA) on the pixel information of the face stimuli. We then obtain a data set composed of these PCA eigenvectors combined with the subjects' gender estimates of the corresponding stimuli. Second, we model the gender classification process on this data set using a separating hyperplane (SH) between both classes. This SH is computed using algorithms from machine learning: the support vector machine (SVM), the relevance vector machine, the prototype classifier, and the K-means classifier. The classification behavior of humans and machines is then analyzed in three steps. First, the classification errors of humans and machines are compared for the various classifiers, and we also assess how well machines can recreate the subjects' internal decision boundary by studying the training errors of the machines. Second, we study the correlations between the rank-order of the subjects' responses to each stimulus-the gender estimate with its reaction time and confidence rating-and the rank-order of the distance of these stimuli to the SH. Finally, we attempt to compare the metric of the representations used by humans and machines for classification by relating the subjects' gender estimate of each stimulus and the distance of this stimulus to the SH. While we show that the classification error alone is not a sufficient selection criterion between the different algorithms humans might use to classify face stimuli, the distance of these stimuli to the SH is shown to capture essentials of the internal decision space of humans. Furthermore, algorithms such as the prototype classifier using stimuli in the center of the classes are shown to be less adapted to model human classification behavior than algorithms such as the SVM based on stimuli close to the boundary between the classes.
引用
收藏
页码:143 / 165
页数:23
相关论文
共 54 条
[1]
[Anonymous], THESIS M PLANCK I BI
[2]
[Anonymous], 2002, Elementary Signal Detection Theory
[3]
The neurobiology of human category learning [J].
Ashby, FG ;
Ell, SW .
TRENDS IN COGNITIVE SCIENCES, 2001, 5 (05) :204-210
[4]
CATEGORIZATION RESPONSE-TIME WITH MULTIDIMENSIONAL STIMULI [J].
ASHBY, FG ;
BOYNTON, G ;
LEE, WW .
PERCEPTION & PSYCHOPHYSICS, 1994, 55 (01) :11-27
[5]
Face recognition by independent component analysis [J].
Bartlett, MS ;
Movellan, JR ;
Sejnowski, TJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (06) :1450-1464
[6]
A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[7]
A principal component analysis of facial expressions [J].
Calder, AJ ;
Burton, AM ;
Miller, P ;
Young, AW ;
Akamatsu, S .
VISION RESEARCH, 2001, 41 (09) :1179-1208
[8]
Churchland Patricia S., 1992, The Computational Brain, DOI DOI 10.7551/MITPRESS/2010.001.0001
[9]
COTTRELL G, 1991, ADV NEURAL INFORMATI, V3, P654
[10]
EMPATH: A neural network that categorizes facial expressions [J].
Dailey, MN ;
Cottrell, GW ;
Padgett, C ;
Adolphs, R .
JOURNAL OF COGNITIVE NEUROSCIENCE, 2002, 14 (08) :1158-1173