Tracking of nonstationary noise based on data-driven recursive noise power estimation

被引:58
作者
Erkelens, Jan S. [1 ]
Heusdens, Richard [1 ]
机构
[1] Delft Univ Technol, Dept Mediamat, NL-2628 CD Delft, Netherlands
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2008年 / 16卷 / 06期
关键词
discrete Fourier transform (DFT)-based speech enhancement; minimum mean-square error (mmse) estimation; noise spectrum estimation; noise tracking;
D O I
10.1109/TASL.2008.2001108
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper considers estimation of the noise spectral variance from speech signals contaminated by highly nonstationary noise sources. The method can accurately track fast changes in noise power level (up to about 10 dB/s). In each time frame, for each, frequency bin, the noise variance estimate is updated recursively with the minimum mean-square error (mmse) estimate of the current noise power. A time- and frequency-dependent smoothing parameter is used, which is varied according to an estimate of speech presence probability. In this way, the amount of speech power leaking into the noise estimates is kept low. For the estimation of the noise power, a spectral gain function is used, which is found by an iterative data-driven training method. The proposed noise tracking method is tested on various stationary and nonstationary noise sources, for a wide range of signal-to-noise ratios, and compared with two state-of-the-art methods. When used in a speech enhancement system, improvements in segmental signal-to-noise ratio of more than 1 dB can be obtained for the most nonstationary noise sources at high noise levels.
引用
收藏
页码:1112 / 1123
页数:12
相关论文
共 31 条
[1]  
[Anonymous], P EURSIPCO ED UK
[2]  
BATINA I, 2006, P IEEE INT C AC SPEE, V3, P1064
[3]  
BEERENDS JG, 2004, EXTENDING PESQ ASSES, P862
[4]  
Benesty J, 2005, SPEECH ENHANCEMENT
[5]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[6]  
BOROWICZ A, 2006, P EUR SIGN PROC C EU
[7]   Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor [J].
Cappe, Olivier .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :345-349
[8]   Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging [J].
Cohen, I .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05) :466-475
[9]   Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models [J].
Cohen, I .
SIGNAL PROCESSING, 2006, 86 (04) :698-709
[10]   Speech enhancement for non-stationary noise environments [J].
Cohen, I ;
Berdugo, B .
SIGNAL PROCESSING, 2001, 81 (11) :2403-2418