Artificial neural networks for nonlinear time-domain filtering of speech

被引:7
作者
Le, TT
Mason, JS
机构
[1] Department of Electrical and Electronic Engineering, University of Wales Swansea, Swansea
来源
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING | 1996年 / 143卷 / 03期
关键词
CELP coding; nonlinear time-domain filtering; speech enhancement; multilayer perceptron;
D O I
10.1049/ip-vis:19960447
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A multilayer perceptron (MLP) is applied as a time domain noalinear filter to two classes of degraded speech, namely gaussian white noise and nonlinear system degradation introduced by a low bit-rate CELP coder. The goal of the study is to examine the influence of the inherent nonlinearity within the MLP, and this is achieved by varying the levels of nonlinearity within the structure. Direct comparisons of MLPs and linear filters show that with CELP degradation the SNR improvements achieved by the MLP is measurably better than with an equivalent linear structure (3dB cf 1.5 dB) but when the degradation is additive noise the two structures perform equally well, The study highlights the importance of scaling to achieve optimum performance, and of matching the enhancer to the degradation.
引用
收藏
页码:149 / 154
页数:6
相关论文
共 13 条
[1]  
ANDERSON B, 1990, P IJCNN 90, P209
[2]  
BOLL SF, 1991, ADV SPEECH SIGNAL PR
[3]  
FECHNER T, 1993, P 3 IEE INT C ART NE, P143
[4]   MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS [J].
HORNIK, K ;
STINCHCOMBE, M ;
WHITE, H .
NEURAL NETWORKS, 1989, 2 (05) :359-366
[5]  
HUSH DR, 1993, IEEE SIGNAL PROC JAN, P8
[6]  
KAOURI HA, 1988, P 6 IEE ICPSC, P230
[7]  
LE TT, 1994, P ICSLP 94, P1152
[8]  
LE TT, 1993, THESIS U COLL SWANSE
[9]  
Lim J. S., 1983, Speech enhancement
[10]  
OGLESBY J, 1991, P IEEE INT C AC SPEE, V1, P393