Fast and accurate sequential floating forward feature selection with the Bayes classifier applied to speech emotion recognition

被引:114
作者
Ververidis, Dimitrios [1 ]
Kotropoulos, Constantine [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Artificial Intelligence & Informat Anal Lab, Thessaloniki 54124, Greece
关键词
Bayes classifier; cross-validation; variance of the correct classification rate of the Bayes classifier; feature selection; wrappers;
D O I
10.1016/j.sigpro.2008.07.001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses subset feature selection performed by the sequential floating forward selection (SFFS). The criterion employed in SFFS is the correct classification rate of the Bayes classifier assuming that the features obey the multivariate Gaussian distribution. A theoretical analysis that models the number of correctly classified utterances as a hypergeometric random variable enables the derivation of an accurate estimate of the variance of the correct classification rate during cross-validation. By employing such variance estimate, we propose a fast SFFS variant. Experimental findings on Danish emotional speech (DES) and speech under simulated and actual stress (SUSAS) databases demonstrate that SFFS computational time is reduced by 50% and the correct classification rate for classifying speech into emotional states for the selected subset of features varies less than the correct classification rate found by the standard SFFS. Although the proposed SIFFS variant is tested in the framework of speech emotion recognition, the theoretical results are valid for any classifier in the context of any wrapper algorithm. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:2956 / 2970
页数:15
相关论文
共 21 条
[1]  
[Anonymous], P EUR SIGN PROC C EU
[2]  
[Anonymous], INT J HUMAN COMPUTER, DOI DOI 10.1016/S1071-581(02)00141-6
[4]   Emotion recognition in human-computer interaction [J].
Cowie, R ;
Douglas-Cowie, E ;
Tsapatsoulis, N ;
Votsis, G ;
Kollias, S ;
Fellenz, W ;
Taylor, JG .
IEEE SIGNAL PROCESSING MAGAZINE, 2001, 18 (01) :32-80
[5]   Approximate statistical tests for comparing supervised classification learning algorithms [J].
Dietterich, TG .
NEURAL COMPUTATION, 1998, 10 (07) :1895-1923
[6]  
Engberg I.S., 1996, Documentation of the Danish Emotional Speech Database DES
[7]  
Evans M., 2000, STAT DISTRIBUTIONS
[8]  
FERRI FJ, 1994, MACH INTELL PATT REC, V16, P403
[9]  
Fukunaga K., 1990, INTRO STAT PATTERN R
[10]   Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition [J].
Hansen, JHL .
SPEECH COMMUNICATION, 1996, 20 (1-2) :151-173