A comparative intelligibility study of single-microphone noise reduction algorithms

被引:177
作者
Hu, Yi [1 ]
Loizou, Philipos C. [1 ]
机构
[1] Univ Texas, Dept Elect Engn, Richardson, TX 75083 USA
关键词
D O I
10.1121/1.2766778
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The evaluation of intelligibility of noise reduction algorithms is reported. IEEE sentences and consonants were corrupted by four types of noise including babble, car, street and train at two siL,nal-to-noise ratio levels (0 and 5 dB), and then processed by eight speech enhancement methods encompassing four classes of algorithms: spectral subtractive, sub-space, statistical model based and Wiener-type algorithms. The enhanced speech was presented to normal-hearing listeners for identification. With the exception of a single noise condition, no algorithm produced significant improvements in speech intelligibility. Information transmission analysis of the consonant confusion matrices indicated that no algorithm improved significantly the place feature score, significantly, which is critically important for speech recognition. The algorithms which were found in previous studies to perform the best in terms of overall quality, were not the same algorithms that performed the best in terms of speech intelligibility. The subspace algorithm, for instance, Was previously found to perform the worst in terms of overall quality, but performed well in the present study in terms of preserving speech intelligibility. Overall, the analysis of consonant confusion matrices sucaests that in order for noise reduction algorithms to improve speech intelligibility, they need to improve the place and manner feature scores. (c) 2007 Acoustical Society of America.
引用
收藏
页码:1777 / 1786
页数:10
相关论文
共 24 条
[1]  
[Anonymous], 1969, IEEE T ACOUST SPEECH, VAU17, P225
[2]  
[Anonymous], 2002, P IEEE ICASSP
[3]   Evaluation of an auditory masked threshold noise suppression algorithm in normal-hearing and hearing-impaired listeners [J].
Arehart, KH ;
Hansen, JHL ;
Gallant, S ;
Kalstein, L .
SPEECH COMMUNICATION, 2003, 40 (04) :575-592
[4]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[5]  
BORDEN G, 1994, SPEECH SCI PRIMIER P
[6]   Noise estimation by minima controlled recursive averaging for robust speech enhancement [J].
Cohen, I ;
Berdugo, B .
IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) :12-15
[7]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445
[8]   Spectral subtraction using reduced delay convolution and adaptive averaging [J].
Gustafsson, H ;
Nordholm, SE ;
Claesson, I .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08) :799-807
[9]  
Hirsch H., 2000, ISCA TUT RES WORKSH
[10]   Speech enhancement based on wavelet thresholding the multitaper spectrum [J].
Hu, Y ;
Loizou, PC .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01) :59-67