Cepstral smoothing of spectral filter gains for speech enhancement without musical noise

被引:50
作者
Breithaupt, Colin [1 ]
Gerkmann, Timo [1 ]
Martin, Rainer [1 ]
机构
[1] Ruhr Univ Bochum, Inst Commun Acoustics, D-44780 Bochum, Germany
关键词
cepstral analysis; cepstral smoothing; musical noise; nonstationary noise; smoothing methods; speech enhancement;
D O I
10.1109/LSP.2007.906208
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Many speech enhancement algorithms that modify short-term spectral magnitudes of the noisy signal by means of adaptive spectral gain functions are plagued by annoying spectral outliers. In this letter, we propose cepstral smoothing as a solution to this problem. We show that cepstral smoothing can effectively prevent spectral peaks of short duration that may be perceived as musical noise. At the same time, cepstral smoothing preserves speech onsets, plosives, and quasi-stationary narrowband structures like voiced speech. The proposed recursive temporal smoothing is applied to higher cepstral coefficients only, excluding those representing the pitch information. As the higher cepstral coefficients describe the finer spectral structure of the Fourier spectrum, smoothing them along time prevents single coefficients of the filter function from changing excessively and independently of their neighboring bins, thus suppressing musical noise. The proposed cepstral smoothing technique is very effective in nonstationary noise.
引用
收藏
页码:1036 / 1039
页数:4
相关论文
共 9 条
  • [1] [Anonymous], P EUR C SPEECH COMM
  • [2] [Anonymous], 1988, NAT I STANDARDS THEC
  • [3] BREITHAUPT C, CEPSTAL SMOOTHING AU
  • [4] BREITHAUPT C, 2006, P INT WORKSH AC ECHO
  • [5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [6] Postprocessing method for suppressing musical noise generated by spectral subtraction
    Goh, Z
    Tan, KC
    Tan, BTG
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03): : 287 - 292
  • [7] Spectral subtraction using reduced delay convolution and adaptive averaging
    Gustafsson, H
    Nordholm, SE
    Claesson, I
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 799 - 807
  • [8] Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments
    Malah, D
    Cox, RV
    Accardi, AJ
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 789 - 792
  • [9] Noise power spectral density estimation based on optimal smoothing and minimum statistics
    Martin, R
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05): : 504 - 512