Speech signal enhancement through adaptive wavelet thresholding

被引:72
作者
Johnson, Michael T.
Yuan, Xiaolong
Ren, Yao
机构
[1] Marquette Univ, Dept Elect & Comp Engn, Milwaukee, WI 53233 USA
[2] Motorola Elect Ltd, Beijing 100022, Peoples R China
关键词
adaptive wavelets; bionic wavelet transform; speech enhancement; denoising; DECOMPOSITION; TRANSFORM;
D O I
10.1016/j.specom.2006.12.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper demonstrates the application of the Bionic Wavelet Transform (BWT), an adaptive wavelet transform derived from a non-linear auditory model of the cochlea, to the task of speech signal enhancement. Results, measured objectively by Signal-to-Noise ratio (SNR) and Segmental SNR (SSNR) and subjectively by Mean Opinion Score (MOS), are given for additive white Gaussian noise as well as four different types of realistic noise environments. Enhancement is accomplished through the use of thresholding on the adapted BWT coefficients, and the results are compared to a variety of speech enhancement techniques, including Ephraim Malah filtering, iterative Wiener filtering, and spectral subtraction, as well as to wavelet denoising based on a perceptually scaled wavelet packet transform decomposition. Overall results indicate that SNR and SSNR improvements for the proposed approach are comparable to those of the Ephraim Malah filter, with BWT enhancement giving the best results of all methods for the noisiest (-10 db and -5 db input SNR) conditions. Subjective measurements using MOS surveys across a variety of 0 db SNR noise conditions indicate enhancement quality competitive with but still lower than results for Ephraim Malah filtering and iterative Wiener filtering, but higher than the perceptually scaled wavelet method. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:123 / 133
页数:11
相关论文
共 29 条
[1]  
Antoniadis A., 1995, Wavelets and Statistics
[2]   Wavelet speech enhancement based on the Teager Energy operator [J].
Bahoura, M ;
Rouat, J .
IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (01) :10-12
[3]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[4]   Speech enhancement using perceptual wavelet packet decomposition and teager energy operator [J].
Chen, SH ;
Wang, JF .
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3) :125-139
[5]  
COHEN I, 2001, EUR 2001 DENM
[6]  
Daubechies I., 1992, 10 LECT WAVELETS, DOI [10.1137/1.9781611970104, DOI 10.1137/1.9781611970104]
[7]  
Debnath L., 2002, Wavelet transforms and their applications, DOI [10.1007/978-1-4612-0097-0, DOI 10.1007/978-1-4612-0097-0]
[8]  
Deller JR, 2000, DISCRETE TIME PROCES, DOI DOI 10.1109/9780470544402.CH11
[9]   DE-NOISING BY SOFT-THRESHOLDING [J].
DONOHO, DL .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (03) :613-627
[10]   Adapting to unknown smoothness via wavelet shrinkage [J].
Donoho, DL ;
Johnstone, IM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) :1200-1224