Speech enhancement for non-stationary noise environments

被引:471
作者
Cohen, I [1 ]
Berdugo, B [1 ]
机构
[1] Lamar Signal Proc Ltd, IL-20692 Yokneam Ilit, Israel
关键词
Noise abatement - Probability density function - Signal to noise ratio - Spectrum analysis - Spurious signal noise;
D O I
10.1016/S0165-1684(01)00128-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present an optimally-modified log-spectral amplitude (OM-LSA) speech estimator and a minima controlled recursive averaging (MCRA) noise estimation approach for robust speech enhancement. The spectral gain function, which minimizes the mean-square error of the log-spectra, is obtained as a weighted geometric mean of the hypothetical gains associated with the speech presence uncertainty. The noise estimate is given by averaging past spectral power values, using a smoothing parameter that is adjusted by the speech presence probability in subbands. We introduce two distinct speech presence probability functions, one for estimating the speech and one for controlling the adaptation of the noise spectrum. The former is based on the time-frequency distribution of the a priori signal-to-noise ratio. The latter is determined by the ratio between the local energy of the noisy signal and its minimum within a specified time window. Objective and subjective evaluation under various environmental conditions confirm the superiority of the OM-LSA and MCRA estimators. Excellent noise suppression is achieved, while retaining weak speech components and avoiding the musical residual noise phenomena. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:2403 / 2418
页数:16
相关论文
共 29 条
  • [1] [Anonymous], P IEEE INT C AC SPEE
  • [2] [Anonymous], P EURSIPCO ED UK
  • [3] Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor
    Cappe, Olivier
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 345 - 349
  • [4] Cohen I., 2001, PROC INT WORKSHOP HA, P95
  • [5] COHEN I, 2001, P 26 IEEE INT C AC S
  • [6] Crochiere R. E., 1983, MULTIRATE DIGITAL SI
  • [7] DOBLINGER G, 1995, P EUR, P1513
  • [8] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02): : 443 - 445
  • [9] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [10] Garofolo J. S., 1988, Tech. rep.