Improved MFCC-Based Feature for Robust Speaker Identification

被引:11
作者
吴尊敬
曹志刚
机构
[1] State Key Laboratory on Microwave and Digital Communications
[2] Department of Electronic Engineering
[3] Tsinghua University
[4] Beijing
[5] China
关键词
Mel-frequency cepstral coefficient (MFCC); robust speaker identification; feature extraction;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the performance of recognition systems because of the mismatches between training and testing. In this paper, the logarithmic transformation in the standard MFCC analysis is replaced by a combined function to improve the noisy sensitivity. The proposed feature extraction process is also combined with speech en- hancement methods, such as spectral subtraction and median-filter to further suppress the noise. Experi- ments show that the proposed robust MFCC-based feature significantly reduces the recognition error rate over a wide signal-to-noise ratio range.
引用
收藏
页码:158 / 161
页数:4
相关论文
empty
未找到相关数据