GLOBAL OPTIMIZATION OF A NEURAL NETWORK-HIDDEN MARKOV MODEL HYBRID

被引:91
作者
BENGIO, Y
DEMORI, R
FLAMMIA, G
KOMPE, R
机构
[1] UNIV AALBORG, CTR SPEECH TECHNOL, AALBORG, DENMARK
[2] MCGILL UNIV, SCH COMP SCI, MONTREAL H3A 2A7, QUEBEC, CANADA
[3] UNIV ERLANGEN NURNBERG, INST PATTERN RECOGNIT, W-8520 ERLANGEN, GERMANY
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1992年 / 3卷 / 02期
关键词
D O I
10.1109/72.125866
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The subject of this paper is the integration of multilayered and recurrent artificial neural networks (ANN's) with hidden Markov models (HMM's). ANN's are suitable for approximating functions that compute new acoustic parameters, whereas HMM's have been proven successful at modeling the temporal structure of the speech signal. In the approach described here, the ANN outputs constitute the sequence of observation vectors for the HMM. An algorithm is proposed for global optimization of all the parameters. Results on speaker-independent recognition experiments using this integrated ANN-HMM system on the TIMIT continuous speech data base are reported.
引用
收藏
页码:252 / 259
页数:8
相关论文
共 30 条
[1]  
[Anonymous], 1987, LEARNING INTERNAL RE
[2]  
[Anonymous], 1989, NIPS 1989
[3]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[4]   PROGRAMMABLE EXECUTION OF MULTI-LAYERED NETWORKS FOR AUTOMATIC SPEECH RECOGNITION [J].
BENGIO, Y ;
CARDIN, R ;
DEMORI, R ;
MERLO, E .
COMMUNICATIONS OF THE ACM, 1989, 32 (02) :195-199
[5]  
BENGIO Y, 1990, APR P INT C AC SPEEC, P537
[6]  
BENGIO Y, 1991, P EUROSPEECH 91 GENO
[7]  
Bengio Y., 1990, ADV NEURAL INFORMATI, VII, P218
[8]  
Bengio Y., 1991, THESIS MCGILL U MONT
[9]   SPEAKER-INDEPENDENT ISOLATED DIGIT RECOGNITION - MULTILAYER PERCEPTRONS VS DYNAMIC TIME WARPING [J].
BOTTOU, L ;
SOULIE, FF ;
BLANCHET, P ;
LIENARD, JS .
NEURAL NETWORKS, 1990, 3 (04) :453-465
[10]  
Bridle J. S., 1990, PROC 2 INT C NEURAL, P211, DOI [10.5555/2969830, DOI 10.5555/2969830]