An algorithm for maximum likelihood estimation of hidden Markov models with unknown state-tying

被引:9
作者
Cappe, O [1 ]
Mokbel, CE [1 ]
Jouvet, D [1 ]
Moulines, E [1 ]
机构
[1] ENST Dept Signal, CNRS, URA 820, F-75634 Paris 13, France
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1998年 / 6卷 / 01期
关键词
expectation-maximization algorithm; hidden Markov models; maximum likelihood estimation; speech recognition;
D O I
10.1109/89.650312
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For speech recognition based on hidden Markov modeling, parameter-tying, which consists in constraining some of the parameters of the model to share the same value, has emerged as a standard practice. In this paper, an original algorithm is proposed that makes it possible to jointly estimate both the shared model parameters and the tying characteristics; using the maximum likelihood criterion, The proposed algorithm is based on a recently introduced extension of the classic expectation-maximization (EM) framework. The convergence properties of this class of algorithms are analyzed in detail. The method is evaluated on an isolated word recognition task using hidden Markov models (HMM's) with Gaussian observation densities and tying at the state level. Finally, the extension of this method to the case of mixture observation densities with tying at the mixture component level is discussed.
引用
收藏
页码:61 / 70
页数:10
相关论文
共 33 条
[21]   ALGORITHM FOR VECTOR QUANTIZER DESIGN [J].
LINDE, Y ;
BUZO, A ;
GRAY, RM .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1980, 28 (01) :84-95
[22]  
Luenberger D.G., 1984, LINEAR NONLINEAR PRO
[23]   ON THE RATE OF CONVERGENCE OF THE ECM ALGORITHM [J].
MENG, XL .
ANNALS OF STATISTICS, 1994, 22 (01) :326-339
[24]  
MENG XL, 1993, BIOMETRIKA, V80, P267, DOI 10.2307/2337198
[25]   MAXIMUM-LIKELIHOOD HIDDEN MARKOV MODELING USING A DOMINANT SEQUENCE OF STATES [J].
MERHAV, N ;
EPHRAIM, Y .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (09) :2111-2115
[26]  
Rabiner L., 1993, Fundamentals of Speech Recognition
[27]   MIXTURE DENSITIES, MAXIMUM-LIKELIHOOD AND THE EM ALGORITHM [J].
REDNER, RA ;
WALKER, HF .
SIAM REVIEW, 1984, 26 (02) :195-237
[28]   OPERATIONAL AND EXPERIMENTAL FRENCH TELECOMMUNICATION SERVICES USING CNET SPEECH RECOGNITION AND TEXT-TO-SPEECH SYNTHESIS [J].
SORIN, C ;
JOUVET, D ;
GAGNOULET, C ;
DUBOIS, D ;
SADEK, D ;
TOULARHOAT, M .
SPEECH COMMUNICATION, 1995, 17 (3-4) :273-286
[29]  
TAKAHASHI S, 1995, P ICASSP, P520
[30]  
WATANABE T, 1994, P INT C SPEECH LANG, P223