Speech Emotion Recognition Based on Sparse Representation

被引：25

作者：

Yan, Jingjie ^{[1
]}

Wang, Xiaolan ^{[2
]}

Gu, Weiyi ^{[2
]}

Ma, Lili ^{[2
]}

机构：

[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Jiangsu, Peoples R China

[2] Southeast Univ, Res Ctr Learning Sci, Nanjing, Jiangsu, Peoples R China

来源：

ARCHIVES OF ACOUSTICS | 2013年 / 38卷 / 04期

基金：

中国国家自然科学基金;

关键词：

speech emotion recognition; sparse partial least squares regression (SPLSR); feature selection and dimensionality reduction; REGRESSION;

D O I：

10.2478/aoa-2013-0055

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech emotion recognition is deemed to be a meaningful and intractable issue among a number of domains comprising sentiment analysis, computer science, pedagogy, and so on. In this study, we investigate speech emotion recognition based on sparse partial least squares regression (SPLSR) approach in depth. We make use of the sparse partial least squares regression method to implement the feature selection and dimensionality reduction on the whole acquired speech emotion features. By the means of exploiting the SPLSR method, the component parts of those redundant and meaningless speech emotion features are lessened to zero while those serviceable and informative speech emotion features are maintained and selected to the following classification step. A number of tests on Berlin database reveal that the recognition rate of the SPLSR method can reach up to 79.23% and is superior to other compared dimensionality reduction methods.

引用

页码：465 / 470

页数：6

共 39 条

[1]

[Anonymous], 2006, Pattern recognition and machine learning

[2] Face recognition using partial least squares components [J].

Baek, J ;

Kim, M .

PATTERN RECOGNITION, 2004, 37 (06) :1303-1306

[3] Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].

Belhumeur, PN ;

Hespanha, JP ;

Kriegman, DJ .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720

[4]

Burkhardt F., 2005, INTERSPEECH, V5, P1517, DOI DOI 10.21437/INTERSPEECH.2005-446

[5]

Cai D, 2007, IEEE DATA MINING, P73, DOI 10.1109/ICDM.2007.89

[6]

Cao K.A., 2011, TECHNICAL REPORT

[7]

Cao K.A., 2009, BMC BIOINFORMATICS, V10

[8] Speech Emotion Recognition Using Canonical Correlation Analysis and Probabilistic Neural Network [J].

Cen, Ling ;

Ser, Wee ;

Yu, Zhu Liang .

SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, :859-+

[9] A new LDA-based face recognition system which can solve the small sample size problem [J].

Chen, LF ;

Liao, HYM ;

Ko, MT ;

Lin, JC ;

Yu, GJ .

PATTERN RECOGNITION, 2000, 33 (10) :1713-1726

[10] Speech emotion recognition: Features and classification models [J].

Chen, Lijiang ;

Mao, Xia ;

Xue, Yuli ;

Cheng, Lee Lung .

DIGITAL SIGNAL PROCESSING, 2012, 22 (06) :1154-1160

← 1 2 3 4 →