A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions

被引:1658
作者
Zeng, Zhihong [1 ]
Pantic, Maja [2 ,3 ]
Roisman, Glenn I. [4 ]
Huang, Thomas S. [1 ]
机构
[1] Univ Illinois, Beckman Inst, Urbana, IL 61801 USA
[2] Univ London Imperial Coll Sci Technol & Med, Dept Comp, London SW7 2AZ, England
[3] Univ Twente, Fac Elect Engn Math & Comp Sci, Enschede, Netherlands
[4] Univ Illinois, Dept Psychol, Champaign, IL 61820 USA
基金
美国国家科学基金会; 欧洲研究理事会;
关键词
Evaluation/methodology; human-centered computing; affective computing; introductory; survey; FACIAL EXPRESSION; EMOTION RECOGNITION; SPEECH; DISCRIMINATION; SEQUENCES; LAUGHTER; FACES;
D O I
10.1109/TPAMI.2008.52
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated analysis of human affective behavior has attracted increasing attention from researchers in psychology, computer science, linguistics, neuroscience, and related disciplines. However, the existing methods typically handle only deliberately displayed and exaggerated expressions of prototypical emotions, despite the fact that deliberate behavior differs in visual appearance, audio profile, and timing from spontaneously occurring behavior. To address this problem, efforts to develop algorithms that can process naturally occurring human affective behavior have recently emerged. Moreover, an increasing number of efforts are reported toward multimodal fusion for human affect analysis, including audiovisual fusion, linguistic and paralinguistic fusion, and multicue visual fusion based on facial expressions, head movements, and body gestures. This paper introduces and surveys these recent advances. We first discuss human emotion perception from a psychological perspective. Next, we examine available approaches for solving the problem of machine understanding of human affective behavior and discuss important issues like the collection and availability of training and test data. We finally outline some of the scientific and engineering challenges to advancing human affect sensing technology.
引用
收藏
页码:39 / 58
页数:20
相关论文
共 160 条
[1]   THIN SLICES OF EXPRESSIVE BEHAVIOR AS PREDICTORS OF INTERPERSONAL CONSEQUENCES - A METAANALYSIS [J].
AMBADY, N ;
ROSENTHAL, R .
PSYCHOLOGICAL BULLETIN, 1992, 111 (02) :256-274
[2]   Weakly Krull and related domains of the form D+M, A+XB[X] and A+X2B[X] [J].
Anderson, David F. ;
Chang, Gyu Whan ;
Park, Jeanam .
ROCKY MOUNTAIN JOURNAL OF MATHEMATICS, 2006, 36 (01) :1-22
[3]  
Ang J., 2002, P 8 INT C SPOK LANG
[4]  
[Anonymous], 2004, COMBINING PATTERN CL, DOI DOI 10.1002/0471660264
[5]  
[Anonymous], INT J WAVELETS MULTI
[6]  
[Anonymous], P 8 EUR C SPEECH COM
[7]  
[Anonymous], HUMAN FACE
[8]  
[Anonymous], P 9 INT C SPOK LANG
[9]  
[Anonymous], 2005, INTERSPEECH 2005
[10]  
[Anonymous], P IEEE INT C COMP VI