From hybrid adjustable neuro-fuzzy systems to adaptive connectionist-based systems for phoneme and word recognition

被引:4
作者
Kasabov, NK [1 ]
Kilgour, RI [1 ]
Sinclair, SJ [1 ]
机构
[1] Univ Otago, Dept Informat Sci, Dunedin, New Zealand
关键词
pattern recognition; artificial intelligence; neural networks; speech recognition;
D O I
10.1016/S0165-0114(98)00233-4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper discusses the problem of adaptation in automatic speech recognition systems (ASRS) and suggests several strategies for adaptation in a modular architecture for speech recognition. The architecture allows for adaptation at different levels of the recognition process, where modules can be adapted individually based on their performance and the performance of the whole system. Two realisations of this architecture are presented along with experimental results from small-scale experiments. The first realisation is a hybrid system for speaker-independent phoneme-based spoken word recognition, consisting of neural net-works for recognising English phonemes and fuzzy systems for modelling acoustic and linguistic knowledge. This system is adjustable by additional training of individual neural network modules and tuning the fuzzy systems. The increased accuracy of the recognition through appropriate adjustment is also discussed. The second realisation of the architecture is a connectionist system that uses fuzzy neural networks FuNNs to accommodate both a prior linguistic knowledge and data from a speech corpus. A method for on-line adaptation of FuNNs is also presented. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:349 / 367
页数:19
相关论文
共 37 条
[21]  
KASABOV NK, 1998, P IIZ 98 C IIZ JAP
[22]  
KILGOUR RI, 1996, THESIS U OTAGO
[23]   Distinct cortical areas associated with native and second languages [J].
Kim, KHS ;
Relkin, NR ;
Lee, KM ;
Hirsch, J .
NATURE, 1997, 388 (6638) :171-174
[24]   EVALUATION AND INTEGRATION OF VISUAL AND AUDITORY INFORMATION IN SPEECH-PERCEPTION [J].
MASSARO, DW ;
COHEN, MM .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1983, 9 (05) :753-771
[25]   FUZZY MULTILAYER PERCEPTRON, INFERENCING AND RULE GENERATION [J].
MITRA, S ;
PAL, SK .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01) :51-63
[26]  
MORGAN CS, 1991, NEURAL NETWORKS SPEE
[27]  
PAL N, 1997, CONNECTIONIST BASED, P221
[28]   APPLICATIONS OF VOICE PROCESSING TO TELECOMMUNICATIONS [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1994, 82 (02) :199-228
[29]  
ROBINSON D, 1988, ARTIFICIAL INTELLIGE
[30]  
Rummery G., 1994, ON LINE Q LEARNING U