Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay

被引:411
作者
Gerkmann, Timo [1 ]
Hendriks, Richard C. [2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Speech Signal Proc Grp, D-26111 Oldenburg, Germany
[2] Delft Univ Technol, Signal & Informat Proc Lab, NL-2628 CD Delft, Netherlands
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2012年 / 20卷 / 04期
关键词
Noise power estimation; speech enhancement; SQUARE ERROR ESTIMATION; SPEECH ENHANCEMENT; ESTIMATION ALGORITHM; SNR; GAMMA;
D O I
10.1109/TASL.2011.2180896
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, it has been proposed to estimate the noise power spectral density by means of minimum mean-square error (MMSE) optimal estimation. We show that the resulting estimator can be interpreted as a voice activity detector (VAD)-based noise power estimator, where the noise power is updated only when speech absence is signaled, compensated with a required bias compensation. We show that the bias compensation is unnecessary when we replace the VAD by a soft speech presence probability (SPP) with fixed priors. Choosing fixed priors also has the benefit of decoupling the noise power estimator from subsequent steps in a speech enhancement framework, such as the estimation of the speech power and the estimation of the clean speech. We show that the proposed speech presence probability (SPP) approach maintains the quick noise tracking performance of the bias compensated minimum mean-square error (MMSE)-based approach while exhibiting less overestimation of the spectral noise power and an even lower computational complexity.
引用
收藏
页码:1383 / 1393
页数:11
相关论文
共 30 条
  • [1] Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors
    Andrianakis, I.
    White, P. R.
    [J]. SPEECH COMMUNICATION, 2009, 51 (01) : 1 - 14
  • [2] [Anonymous], 1988, NAT I STANDARDS THEC
  • [3] Berouti M., 1979, ICASSP 79. 1979 IEEE International Conference on Acoustics, Speech and Signal Processing, P208
  • [4] A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing
    Breithaupt, Colin
    Gerkmann, Timo
    Martin, Rainer
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4897 - 4900
  • [5] Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions
    Breithaupt, Colin
    Martin, Rainer
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 277 - 289
  • [6] Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor
    Cappe, Olivier
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 345 - 349
  • [7] Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    Cohen, I
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 466 - 475
  • [8] Speech enhancement using super-Gaussian speech models and noncausal a priori SNR estimation
    Cohen, I
    [J]. SPEECH COMMUNICATION, 2005, 47 (03) : 336 - 350
  • [9] Speech enhancement for non-stationary noise environments
    Cohen, I
    Berdugo, B
    [J]. SIGNAL PROCESSING, 2001, 81 (11) : 2403 - 2418
  • [10] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02): : 443 - 445