Connectionist Probability Estimators in HMM Speech Recognition

被引:102
作者
Renals, Steve [1 ]
Morgan, Nelson [1 ]
Bourlard, Herve [2 ]
Cohen, Michael [3 ]
Franco, Horacio [3 ]
机构
[1] Int Comp Sci Inst, Berkeley, CA 94704 USA
[2] Lernout & Hauspie Speechprod, Ieper, Belgium
[3] SRI Int, Menlo Pk, CA 94025 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1994年 / 2卷 / 01期
关键词
D O I
10.1109/89.260359
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We are concerned with integrating connectionist networks into a hidden Markov model (HMM) speech recognition system. This is achieved through a statistical interpretation of connectionist networks as probability estimators. We review the basis of HMM speech recognition and point out the possible benefits of incorporating connectionist networks. Issues necessary to the construction of a connectionist HMM recognition system are discussed, including choice of connectionist probability estimator. We describe the performance of such a system using a multilayer perceptron probability estimator evaluated on the speaker-independent DARPA Resource Management database. In conclusion, we show that a connectionist component improves a state-of-the-art HMM system.
引用
收藏
页码:161 / 174
页数:14
相关论文
共 73 条
[31]   PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH [J].
HERMANSKY, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (04) :1738-1752
[32]  
Hertz J., 1991, INTRO THEORY NEURAL
[33]   LEARNING ALGORITHMS AND PROBABILITY-DISTRIBUTIONS IN FEEDFORWARD AND FEEDBACK NETWORKS [J].
HOPFIELD, JJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (23) :8429-8433
[34]  
HUANG W, 1988, P IEEE INT C ACOUSTI, P99
[35]  
ISO K, 1990, INT CONF ACOUST SPEE, P441, DOI 10.1109/ICASSP.1990.115744
[36]  
Jelinek F., 1969, IBM Journal of Research and Development, V13, P675, DOI 10.1147/rd.136.0675
[37]   MIXTURE AUTOREGRESSIVE HIDDEN MARKOV-MODELS FOR SPEECH SIGNALS [J].
JUANG, BH ;
RABINER, LR .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (06) :1404-1413
[38]  
KOHONEN T, 1988, P IEEE INT C NEUR NE, V1, P61
[39]  
LEE KF, 1988, THESIS CARNEGIE MELL
[40]  
LEVIN E, 1990, INT CONF ACOUST SPEE, P433, DOI 10.1109/ICASSP.1990.115740