Automatic facial expression recognition using facial animation parameters and multistream HMMs

被引:133
作者
Aleksic, Petar S. [1 ]
Katsaggelos, Aggelos K. [1 ]
机构
[1] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
关键词
facial expression recognition; multistream HMMs; facial animation parameters;
D O I
10.1109/TIFS.2005.863510
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The performance of an automatic facial expression recognition system can be significantly improved by modeling the reliability of different streams of facial expression information utilizing multistream hidden Markov models (HMMs). In this paper, we present an automatic multistream HMM facial expression recognition system and analyze its performance. The proposed system utilizes facial animation parameters (FAPs), supported by the MPEG-4 standard, as features for facial expression classification. Specifically, the FAPs describing the movement of the outer-lip contours and eyebrows are used as observations. Experiments are first performed employing single-stream HMMs under several different scenarios, utilizing outer-lip and eyebrow FAPs individually and jointly. A multistream HMM approach is proposed for introducing facial expression and FAP group dependent stream reliability weights. The stream weights are determined based on the facial expression recognition results obtained when FAP streams are utilized individually. The proposed multistream HMM facial expression system, which utilizes stream reliability weights, achieves relative reduction of the facial expression recognition error of 44% compared to the single-stream HMM system.
引用
收藏
页码:3 / 11
页数:9
相关论文
共 39 条
[1]  
Aleksic P., 2003, P WORKS MULT US AUTH, P80
[2]   Audio-visual speech recognition using MPEGA compliant visual features [J].
Aleksic, PS ;
Williams, JJ ;
Wu, ZL ;
Katsaggelos, AK .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (11) :1213-1227
[3]  
ALEKSIC PS, 2005, 6 E WORKSH IM AN MUL
[4]  
[Anonymous], ADV NEURAL INFORM PR
[5]  
BARTLETT M, 2003, P CVPR
[6]  
Bourlard H, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P426
[7]   Real-time lip tracking and bimodal continuous speech recognition [J].
Chan, MT ;
Zhang, Y ;
Huang, TS .
1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, :65-70
[8]  
COHN J, 1997, 7 EUR C FAC EXP MEAS, P329
[9]   Classifying facial actions [J].
Donato, G ;
Bartlett, MS ;
Hager, JC ;
Ekman, P ;
Sejnowski, TJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (10) :974-989
[10]  
Ekman P., 1978, Facial action coding system: manual