Noisy speech processing by recurrently adaptive fuzzy filters

被引:23
作者
Juang, CF [1 ]
Lin, CT
机构
[1] Chung Chou Inst Technol, Dept Elect Engn, Changhua, Taiwan
[2] Natl Chiao Tung Univ, Dept Elect & Control Engn, Hsinchu 300, Taiwan
关键词
adaptive noise cancellation; noisy speech recognition; real-time recurrent learning; structure identification;
D O I
10.1109/91.917120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Two noisy speech processing problems-speech enhancement and noisy speech recognition-are dealt with in this paper. The technique we focus on is by using the filtering approach; a novel filter, the recurrently adaptive fuzzy filter (RAFF), is proposed and applied to these two problems. The speech enhancement is based on adaptive noise cancellation with two microphones, where the RAFF is used to eliminate the noise corrupting the desired speech signal in the primary channel, As to the noisy speech recognition, the RAFF is used to filter the noise in the feature domain of speech signals. The RAFF is inherently a recurrent multilayered connectionist network for realizing the basic elements and functions of dynamic fuzzy inference, and may be considered to be constructed from a series of dynamic fuzzy rules. As compared to other existing nonlinear filters, three major advantages of the RAFF are observed: 1) a priori knowledge can be incorporated into the RAFF, which makes the fusion of numerical data and linguistic information possible; 2) owing to the dynamic property of the RAFF, the exact lagged order of the input variables need not be known in advance; 3) no predetermination, like the number of hidden nodes, must be given since the RAFF can find its optimal structure and parameters automatically Several examples on adaptive noise cancellation and noisy speech recognition problems using the RAFF are illustrated to demonstrate the performance of the RAFF.
引用
收藏
页码:139 / 152
页数:14
相关论文
共 37 条
[1]   THE USE OF NEURAL NETS TO COMBINE EQUALIZATION WITH DECODING FOR SEVERE INTERSYMBOL INTERFERENCE CHANNELS [J].
ALMASHOUQ, KA ;
REED, IS .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (06) :982-988
[2]  
[Anonymous], 1987, Adaptive digital filtering and signal analysis
[3]   RECURRENT RADIAL BASIS FUNCTION NETWORKS FOR ADAPTIVE NOISE CANCELLATION [J].
BILLINGS, SA ;
FUNG, CF .
NEURAL NETWORKS, 1995, 8 (02) :273-290
[4]  
Coker M. J., 1980, ICASSP 80 Proceedings. IEEE International Conference on Acoustics, Speech and Signal Processing, P470
[5]   FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211
[6]   THE APPLICATION OF NONLINEAR STRUCTURES TO THE RECONSTRUCTION OF BINARY SIGNALS [J].
GIBSON, GJ ;
SIU, S ;
COWAN, CFN .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (08) :1877-1884
[7]   SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY [J].
GONG, YF .
SPEECH COMMUNICATION, 1995, 16 (03) :261-291
[8]  
GORRINI V, 1994, PROCEEDINGS OF THE THIRD IEEE CONFERENCE ON FUZZY SYSTEMS - IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, VOLS I-III, P193, DOI 10.1109/FUZZY.1994.343687
[9]   FUNCTIONAL EQUIVALENCE BETWEEN RADIAL BASIS FUNCTION NETWORKS AND FUZZY INFERENCE SYSTEMS [J].
JANG, JSR ;
SUN, CT .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (01) :156-159
[10]  
Jordan MI, 1986, Cogn Sci, P531