Hierarchical singleton-type recurrent neural fuzzy networks for noisy speech recognition

被引:39
作者
Juang, Chia-Feng [1 ]
Chiou, Chyi-Tian [1 ]
Lai, Chun-Lung [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Elect Engn, Taichung 402, Taiwan
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2007年 / 18卷 / 03期
关键词
hierarchical networks; neural filters; neural fuzzy networks; noisy speech filtering; recurrent neural networks;
D O I
10.1109/TNN.2007.891194
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes noisy speech recognition using hierarchical singleton-type recurrent neural fuzzy networks (HSRNFNs). The proposed HSRNFN is a hierarchical connection of two singleton-type recurrent neural fuzzy networks (SRNFNs), where one is used for noise filtering and the other for recognition. The SRNFN is constructed by recurrent fuzzy if-then rules with fuzzy singletons in the consequences, and their recurrent properties make them suitable for processing speech patterns with temporal characteristics. Inn, words recognition, 77, SRNFNs are created for modeling n words, where each SRNFN receives the current frame feature and predicts the next one of its modeling word. The prediction error of each SRNFN is used as recognition criterion. In filtering, one SRNFN is created, and each SRNFN recognizer is connected to the same SRNFN filter, which filters noisy speech patterns in the feature domain before feeding them to the SRNFN recognizer. Experiments with Mandarin word recognition under different types of noise are performed. Other recognizers, including multilayer perceptron (MLP), time-delay neural networks (TDNNs), and hidden Markov models (HMMs), are also tested and compared. These experiments and comparisons demonstrate good results with HSRNFN for noisy speech recognition tasks.
引用
收藏
页码:833 / 843
页数:11
相关论文
共 28 条
[1]  
BOURLARD HA, 1994, CONNECTIONIST SPEECH, pCH7
[2]  
*CAMBR U ENG DEP, HIDD MARK MOD TOOLK
[3]  
GAO Y, 1994, IEEE INT C AC SPEECH, V2, P89
[4]  
Gillick L., 1989, ICASSP-89: 1989 International Conference on Acoustics, Speech and Signal Processing (IEEE Cat. No.89CH2673-2), P532, DOI 10.1109/ICASSP.1989.266481
[5]   SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY [J].
GONG, YF .
SPEECH COMMUNICATION, 1995, 16 (03) :261-291
[6]  
GORRINI V, 1994, PROCEEDINGS OF THE THIRD IEEE CONFERENCE ON FUZZY SYSTEMS - IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, VOLS I-III, P193, DOI 10.1109/FUZZY.1994.343687
[7]   A TSK-type recurrent fuzzy network for dynamic systems processing by neural network and genetic algorithms [J].
Juang, CF .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2002, 10 (02) :155-170
[8]   Noisy speech processing by recurrently adaptive fuzzy filters [J].
Juang, CF ;
Lin, CT .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2001, 9 (01) :139-152
[9]   Identification and control of dynamic systems using recurrent fuzzy neural networks [J].
Lee, CH ;
Teng, CC .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2000, 8 (04) :349-366
[10]  
LEVIN E, 1990, INT CONF ACOUST SPEE, P433, DOI 10.1109/ICASSP.1990.115740