An algorithm for maximum likelihood estimation of hidden Markov models with unknown state-tying

被引:9
作者
Cappe, O [1 ]
Mokbel, CE [1 ]
Jouvet, D [1 ]
Moulines, E [1 ]
机构
[1] ENST Dept Signal, CNRS, URA 820, F-75634 Paris 13, France
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1998年 / 6卷 / 01期
关键词
expectation-maximization algorithm; hidden Markov models; maximum likelihood estimation; speech recognition;
D O I
10.1109/89.650312
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For speech recognition based on hidden Markov modeling, parameter-tying, which consists in constraining some of the parameters of the model to share the same value, has emerged as a standard practice. In this paper, an original algorithm is proposed that makes it possible to jointly estimate both the shared model parameters and the tying characteristics; using the maximum likelihood criterion, The proposed algorithm is based on a recently introduced extension of the classic expectation-maximization (EM) framework. The convergence properties of this class of algorithms are analyzed in detail. The method is evaluated on an isolated word recognition task using hidden Markov models (HMM's) with Gaussian observation densities and tying at the state level. Finally, the extension of this method to the case of mixture observation densities with tying at the mixture component level is discussed.
引用
收藏
页码:61 / 70
页数:10
相关论文
共 33 条
[1]  
[Anonymous], P EUR 93 BERL
[2]  
[Anonymous], 1958, INTRO MULTIVARIATE S
[3]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[4]   TIED MIXTURE CONTINUOUS PARAMETER MODELING FOR SPEECH RECOGNITION [J].
BELLEGARDA, JR ;
NAHAMOO, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (12) :2033-2045
[5]  
DEMORI R, 1996, SPEECH REC COURS MCG
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]   SPEAKER ADAPTATION USING CONSTRAINED ESTIMATION OF GAUSSIAN MIXTURES [J].
DIGALAKIS, VV ;
RTISCHEV, D ;
NEUMEYER, LG .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (05) :357-366
[8]  
GALES MJF, 1996, 242 CUEDFINFENGTR
[9]   Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains [J].
Gauvain, Jean-Luc ;
Lee, Chin-Hui .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :291-298
[10]  
Hart P.E., 1973, Pattern recognition and scene analysis