Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay

被引:411
作者
Gerkmann, Timo [1 ]
Hendriks, Richard C. [2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Speech Signal Proc Grp, D-26111 Oldenburg, Germany
[2] Delft Univ Technol, Signal & Informat Proc Lab, NL-2628 CD Delft, Netherlands
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2012年 / 20卷 / 04期
关键词
Noise power estimation; speech enhancement; SQUARE ERROR ESTIMATION; SPEECH ENHANCEMENT; ESTIMATION ALGORITHM; SNR; GAMMA;
D O I
10.1109/TASL.2011.2180896
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, it has been proposed to estimate the noise power spectral density by means of minimum mean-square error (MMSE) optimal estimation. We show that the resulting estimator can be interpreted as a voice activity detector (VAD)-based noise power estimator, where the noise power is updated only when speech absence is signaled, compensated with a required bias compensation. We show that the bias compensation is unnecessary when we replace the VAD by a soft speech presence probability (SPP) with fixed priors. Choosing fixed priors also has the benefit of decoupling the noise power estimator from subsequent steps in a speech enhancement framework, such as the estimation of the speech power and the estimation of the clean speech. We show that the proposed speech presence probability (SPP) approach maintains the quick noise tracking performance of the bias compensated minimum mean-square error (MMSE)-based approach while exhibiting less overestimation of the spectral noise power and an even lower computational complexity.
引用
收藏
页码:1383 / 1393
页数:11
相关论文
共 30 条
  • [11] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [12] Minimum mean-square error estimation of discrete fourier coefficients with generalized gamma priors
    Erkelens, Jan S.
    Hendriks, Richard C.
    Heusdens, Richard
    Jensen, Jesper
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (06): : 1741 - 1752
  • [13] Improved a posteriori speech presence probability estimation based on a likelihood ratio with fixed priors
    Gerkmann, Timo
    Breithaupt, Colin
    Martin, Rainer
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05): : 910 - 919
  • [14] Gerkmann T, 2011, 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), P145, DOI 10.1109/ASPAA.2011.6082266
  • [15] SPEECH PRESENCE PROBABILITY ESTIMATION BASED ON TEMPORAL CEPSTRUM SMOOTHING
    Gerkmann, Timo
    Krawczyk, Martin
    Martin, Rainer
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4254 - 4257
  • [16] Gradshteyn S., 2014, Table of Integrals, Series, and Products, V8th
  • [17] Noise tracking using DFT domain subspace decompositions
    Hendriks, Richard C.
    Jensen, Jesper
    Heusdens, Richard
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 541 - 553
  • [18] MMSE BASED NOISE PSD TRACKING WITH LOW COMPLEXITY
    Hendriks, Richard C.
    Heusdens, Richard
    Jensen, Jesper
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4266 - 4269
  • [19] Jang K. W., 2007, C INT SPEECH COMM AS, P2917
  • [20] Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model
    Lotter, T
    Vary, P
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (07) : 1110 - 1126