SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY

被引:341
作者
GONG, YF [1 ]
机构
[1] BROADCAST TECHNOL RES BRANCH, COMMUN RES CTR, DEPT COMMUN, OTTAWA, ON, CANADA
关键词
SURVEY; NOISY SPEECH RECOGNITION; PARAMETRIZATION; SPEECH ENHANCEMENT; COMPENSATION FOR NOISE;
D O I
10.1016/0167-6393(94)00059-J
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The performance levels of most current speech recognizers degrade significantly when environmental noise occurs during use. Such performance degradation is mainly caused by mismatches in training and operating environments. During recent years much effort has been directed to reducing this mismatch. This paper surveys research results in the area of digital techniques for single microphone noisy speech recognition classified in three categories: noise resistant features and similarity measurement, speech enhancement, and speech model compensation for noise. The survey indicates that the essential points in noisy speech recognition consist of incorporating time and frequency correlations, giving more importance to high SNR portions of speech in decision making, exploiting task-specific a priori knowledge both of speech and of noise, using class-dependent processing, and including auditory models in speech processing.
引用
收藏
页码:261 / 291
页数:31
相关论文
共 246 条
[1]  
Acero A., 1993, ACOUSTICAL ENV ROBUS
[2]  
ACERO A, 1992, 1992 ESCA WORKSH P S, P89
[3]  
ACERO A, 1990, 1990 INT C SPEECH LA, P1121
[4]  
ACERO A, 1990, 1990 P IEEE INT C AC, P849
[5]   ROOT CEPSTRAL ANALYSIS - A UNIFIED VIEW - APPLICATION TO SPEECH PROCESSING IN CAR NOISE ENVIRONMENTS [J].
ALEXANDRE, P ;
LOCKWOOD, P .
SPEECH COMMUNICATION, 1993, 12 (03) :277-288
[6]  
ALEXANDRE P, 1993, 1993 P EUR C SPEECH, V2, P1255
[7]  
ALEXANDRE P, 1993, 1993 P IEEE INT C AC, V2, P99
[8]  
ANASTASAKOS A, 1994, 1994 P IEEE INT C AC, V1, P433
[9]  
Anderson T., 1984, INTRO MULTIVARIATE S
[10]  
ANGLADE Y, 1993, IEEE T ACOUST SPEECH, V2, P279