Noise compensation methods for hidden Markov model speech recognition in adverse environments

被引:55
作者
Vaseghi, SV
Milner, BP
机构
[1] Department of Electrical and Electronic Engineering, Queen's University of Belfast, Belfast
[2] Speech Technology Unit, BT Laboratories, Martlesham Heath, Suffolk
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1997年 / 5卷 / 01期
关键词
D O I
10.1109/89.554264
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several noise compensation schemes for speech recognition in impulsive and nonimpulsive noise are considered. The noise compensation schemes are spectral subtraction, HMM-based Wiener filters, noise-adaptive HMM's, and a front-end impulsive noise removal. The use of the cepstral-time matrix as an improved speech feature set is explored, and the noise compensation methods are extended for use with cepstral-time features. Experimental evaluations, on a spoken digit database, in the presence of car noise, helicopter noise, and impulsive noise, demonstrate that the noise compensation methods achieve substantial improvement in recognition across a wide range of signal-to-noise ratios. The results also show that the Cepstral-time matrix is more robust than a vector of identical size, which is composed of a combination of cepstral and differential cepstral features.
引用
收藏
页码:11 / 21
页数:11
相关论文
共 30 条
[1]  
[Anonymous], 1996, Advanced Signal Processing and Digital Noise Reduction
[2]   SPOKEN-WORD RECOGNITION USING DYNAMIC FEATURES ANALYZED BY TWO-DIMENSIONAL CEPSTRUM [J].
ARIKI, Y ;
MIZUTA, S ;
NAGATA, M ;
SAKAI, T .
IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1989, 136 (02) :133-140
[3]  
BEATTIE VL, 1991, P ICASSP, P917
[4]  
BERSTEIN AD, 1991, P ICASSP, P913
[5]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[6]  
BRIDLE JS, 1984, P I ACOUST 4, V6, P307
[7]  
Deller Jr J. R., 1993, DISCRETE TIME PROCES
[8]   ON THE APPLICATION OF HIDDEN MARKOV-MODELS FOR ENHANCING NOISY SPEECH [J].
EPHRAIM, Y ;
MALAH, D ;
JUANG, BH .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12) :1846-1856
[9]  
FLORES JAN, 1993, P EUR, P829
[10]   SPEAKER-INDEPENDENT ISOLATED WORD RECOGNITION USING DYNAMIC FEATURES OF SPEECH SPECTRUM [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (01) :52-59