An improved maximum model distance approach for HMM-based speech recognition systems

被引:11
作者
He, QH [1 ]
Kwong, S [1 ]
Man, KF [1 ]
Tang, KS [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1016/S0031-3203(99)00144-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an improved maximum model distance (IMMD) approach for HMM-based speech recognition systems based on our previous work [S. Kwong, Q.H. He, K.F. Man, K.S. Tang. A maximum model distance approach for HMM-based speech recognition, Pattern Recognition 31 (3) (1998) 219-229]. It defines a more realistic model distance definition for HMM training, and utilizes the limited training data in a more effective manner. Discriminative information contained in the training data was used to improve the performance of the recognizer. HMM parameter adjustment rules were induced in details. Theoretical and practical issues concerning this approach are also discussed and investigated in this paper. Both isolated word and continuous speech recognition experiments showed that a significant error reduction could be achieved by IMMD when compared with the maximum model distance (MMD) criterion and other training methods using the minimum classification error (MCE) and the maximum mutual information (MMI) approaches. (C) 2000 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1749 / 1758
页数:10
相关论文
共 15 条
[1]  
BAHL R, 1986, P 1986 IEEE INT C AC, P49
[2]  
CHANG PC, 1992, P ICASSP 92 SAN FRAN, V1, P493
[3]  
Chou W., 1994, International Journal of Pattern Recognition and Artificial Intelligence, V8, P5, DOI 10.1142/S0218001494000024
[4]   A MINIMUM DISCRIMINATION INFORMATION APPROACH FOR HIDDEN MARKOV MODELING [J].
EPHRAIM, Y ;
DEMBO, A ;
RABINER, LR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1989, 35 (05) :1001-1013
[5]   ON THE RELATIONS BETWEEN MODELING APPROACHES FOR SPEECH RECOGNITION [J].
EPHRAIM, Y ;
RABINER, LR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1990, 36 (02) :372-380
[6]   Stochastic trajectory modeling and sentence searching for continuous speech recognition [J].
Gong, YF .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (01) :33-44
[7]   Efficient training algorithms for HMM's using incremental estimation [J].
Gotoh, Y ;
Hochberg, MM ;
Silverman, HF .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (06) :539-548
[8]   A PROBABILISTIC DISTANCE MEASURE FOR HIDDEN MARKOV-MODELS [J].
JUANG, BH ;
RABINER, LR .
AT&T TECHNICAL JOURNAL, 1985, 64 (02) :391-408
[9]   Minimum classification error rate methods for speech recognition [J].
Juang, BH ;
Chou, W ;
Lee, CH .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03) :257-265
[10]  
Kim NS, 1998, IEEE T SPEECH AUDI P, V6, P299, DOI 10.1109/89.668824