Speech enhancement based on wavelet thresholding the multitaper spectrum

被引:158
作者
Hu, Y [1 ]
Loizou, PC [1 ]
机构
[1] Univ Texas, Dept Elect Engn, Richardson, TX 75083 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2004年 / 12卷 / 01期
基金
美国国家卫生研究院;
关键词
multitaper method; musical noise; power spectrum estimation; speech enhancement; wavelet thresholding;
D O I
10.1109/TSA.2003.819949
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is well known that the "musical noise" encountered in most frequency domain speech enhancement algorithms is partially due to the large variance estimates of the spectra. To address this issue, we propose in this paper the use of low-variance spectral estimators based on wavelet thresholding the multitaper spectra for speech enhancement. A short-time spectral amplitude estimator is derived which incorporates the wavelet-thresholded multitaper spectra. Listening tests showed that the use of multitaper spectrum estimation combined with wavelet thresholding suppressed the musical noise and yielded better quality than the subspace and MMSE algorithms.
引用
收藏
页码:59 / 67
页数:9
相关论文
共 37 条
[1]  
[Anonymous], 1995, TRANSLATION INVARIAN
[2]  
[Anonymous], P IEEE INT C AC SPEE
[3]  
[Anonymous], 1993, SPECTRAL ANAL PHYS A, DOI [10.1017/cbo9780511622762, DOI 10.1017/CBO9780511622762, 10.1017/CBO9780511622762]
[4]   Wavelet speech enhancement based on the Teager Energy operator [J].
Bahoura, M ;
Rouat, J .
IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (01) :10-12
[5]  
BARTLETT MS, 1946, J ROY STAT SOC B, V8, P128
[6]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[7]   Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor [J].
Cappe, Olivier .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :345-349
[8]   Multitaper power spectrum estimation and thresholding:: Wavelet packets versus wavelets [J].
Cristán, AC ;
Walden, AT .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (12) :2976-2986
[9]   DE-NOISING BY SOFT-THRESHOLDING [J].
DONOHO, DL .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (03) :613-627
[10]   IDEAL SPATIAL ADAPTATION BY WAVELET SHRINKAGE [J].
DONOHO, DL ;
JOHNSTONE, IM .
BIOMETRIKA, 1994, 81 (03) :425-455