From hybrid adjustable neuro-fuzzy systems to adaptive connectionist-based systems for phoneme and word recognition

被引:4
作者
Kasabov, NK [1 ]
Kilgour, RI [1 ]
Sinclair, SJ [1 ]
机构
[1] Univ Otago, Dept Informat Sci, Dunedin, New Zealand
关键词
pattern recognition; artificial intelligence; neural networks; speech recognition;
D O I
10.1016/S0165-0114(98)00233-4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper discusses the problem of adaptation in automatic speech recognition systems (ASRS) and suggests several strategies for adaptation in a modular architecture for speech recognition. The architecture allows for adaptation at different levels of the recognition process, where modules can be adapted individually based on their performance and the performance of the whole system. Two realisations of this architecture are presented along with experimental results from small-scale experiments. The first realisation is a hybrid system for speaker-independent phoneme-based spoken word recognition, consisting of neural net-works for recognising English phonemes and fuzzy systems for modelling acoustic and linguistic knowledge. This system is adjustable by additional training of individual neural network modules and tuning the fuzzy systems. The increased accuracy of the recognition through appropriate adjustment is also discussed. The second realisation of the architecture is a connectionist system that uses fuzzy neural networks FuNNs to accommodate both a prior linguistic knowledge and data from a speech corpus. A method for on-line adaptation of FuNNs is also presented. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:349 / 367
页数:19
相关论文
共 37 条
[1]  
AMARI S, 1997, BRAIN LIKE COMPUTING
[2]  
[Anonymous], P 5 IFSA WORLD C IFS
[3]  
[Anonymous], 1997, PERCEIVING TALKING F
[4]  
[Anonymous], 1989, GENETIC ALGORITHMS S
[5]  
CLARK CY, 1990, INTRO PHONETICS PHON
[6]   THE CHALLENGE OF SPOKEN LANGUAGE SYSTEMS - RESEARCH DIRECTIONS FOR THE NINETIES [J].
COLE, R ;
HIRSCHMAN, L ;
ATLAS, L ;
BECKMAN, M ;
BIERMANN, A ;
BUSH, M ;
CLEMENTS, M ;
COHEN, J ;
GARCIA, O ;
HANSON, B ;
HERMANSKY, H ;
LEVINSON, S ;
MCKEOWN, K ;
MORGAN, N ;
NOVICK, DG ;
OSTENDORF, M ;
OVIATT, S ;
PRICE, P ;
SILVERMAN, H ;
SPITZ, J ;
WAIBEL, A ;
WEINSTEIN, C ;
ZAHORIAN, S ;
ZUE, V .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :1-21
[7]  
FU LM, 1989, P 1 IEEE INT C ART N, P221
[8]  
Huo Q, 1996, INT CONF ACOUST SPEE, P705, DOI 10.1109/ICASSP.1996.543218
[9]   ANFIS - ADAPTIVE-NETWORK-BASED FUZZY INFERENCE SYSTEM [J].
JANG, JSR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1993, 23 (03) :665-685
[10]  
Kasabov N., 1995, Proceedings. 1995. Second New Zealand International Two-Stream Conference on Artificial Neural Networks and Expert Systems, P294, DOI 10.1109/ANNES.1995.499492