Perceptual wavelet-representation of speech signals and its application to speech enhancement

被引:18
作者
Pinter, I
机构
[1] GAMF Technical College, Department of Informatics, H-6000 Kecskemét
关键词
D O I
10.1006/csla.1996.0001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When considering the speech as a non-stationary signal and giving up the quasi-stationary approach, the wavelet transform is a possible analysis method. Though this technique has been applied to a wide variety of speech processing tasks, the problem of how to derive the speech-tailored wavelets is not solved definitely. Therefore, in this paper the so-called perceptual wavelet transform and the corresponding speech representation are introduced as possible solutions. Although the proposed transform has been derived heuristically-namely, to be optimal in the perceptual frequency scale in Gabor-sense and to perform a 1 CB speech analysis-it appears that this is a self-invertible, overcomplete, shiftable transform. This is an important fact in the light of recent results in the so-called wavelet-frame-based denoising; moreover, the concept of soft-thresholding of the latter can also be introduced heuristically by applying the auditory masking phenomenon. The recent paper describes the perceptual wavelet functions in detail, presents the properties of this method in a time-domain analysis example, and two novel speech representations are given, which are somewhat similar to each other and to the conventional spectrogram. Finally, a new speech enhancement method is proposed, which consists of three stages: a perceptual wavelet-decomposition, followed by a compressive non-linearity (with adjustable parameters to the noise process), and summing. (C) 1996 Academic Press Limited.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 52 条
[1]  
AMBIKAIRAJAH E, 1993, SEP P C EUR 93, P151
[2]  
[Anonymous], P INT C AC SPEECH SI
[3]  
BASILE P, 1993, VISUAL REPRESENTATIO
[4]  
BOFF KR, 1986, HDB PERCEPTION HUMAN, V2, P27
[5]   THE SCALE REPRESENTATION [J].
COHEN, L .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3275-3292
[6]  
DALESSANDRO C, 1993, VISUAL REPRESENTATIO
[7]   THE WAVELET TRANSFORM, TIME-FREQUENCY LOCALIZATION AND SIGNAL ANALYSIS [J].
DAUBECHIES, I .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1990, 36 (05) :961-1005
[8]  
DAUBECHIES I, 1990, WAVELETS
[9]  
DERMODY P, 1993, VISUAL REPRESENTATIO
[10]   WAVELET ANALYSIS IN RECRUITMENT OF LOUDNESS COMPENSATION [J].
DRAKE, LA ;
RUTLEDGE, JC ;
COHEN, J .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3306-3312