Genones: Generalized mixture tying in continuous hidden Markov model-based speech recognizers

被引:62
作者
Digalakis, VV [1 ]
Monaco, P [1 ]
Murveit, H [1 ]
机构
[1] SRI INT, SPEECH TECHNOL & RES LAB, MENLO PK, CA 94025 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1996年 / 4卷 / 04期
关键词
D O I
10.1109/89.506931
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An algorithm is proposed that achieves a good tradeoff between modeling resolution and robustness by using a new, general scheme for tying of mixture components in continuous mixture-density hidden Markov model (HMM)-based speech recognizers. The sets of HMM states that share the same mixture components are determined automatically using agglomerative clustering techniques. Experimental results on ARPA's Wall Street Journal corpus show that this scheme reduces errors by 25% over typical tied-mixture systems. New fast algorithms for computing Gaussian likelihoods-the most time-consuming aspect of continuous-density HMM systems-are also presented. These new algorithms significantly reduce the number of Gaussian densities that are evaluated with little or no impact on speech recognition accuracy.
引用
收藏
页码:281 / 289
页数:9
相关论文
共 25 条
[1]  
[Anonymous], P WORKSH HUM LANG TE
[2]  
Aubert X., 1993, ICASSP-93. 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing (Cat. No.92CH3252-4), P648, DOI 10.1109/ICASSP.1993.319393
[3]  
Bahl L., 1991, P DARPA SPEECH NAT L, P264
[4]  
Baum L.E., 1972, Inequalities III: Proceedings of the Third Symposium on Inequalities, page, V3, P1
[5]   TIED MIXTURE CONTINUOUS PARAMETER MODELING FOR SPEECH RECOGNITION [J].
BELLEGARDA, JR ;
NAHAMOO, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (12) :2033-2045
[6]  
Bocchieri E., 1993, ICASSP-93. 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing (Cat. No.92CH3252-4), P692, DOI 10.1109/ICASSP.1993.319405
[7]  
DODDINGTON G, 1992, P ARPA WORKSH SPOK L
[8]  
Duda R. O., 1973, PATTERN CLASSIFICATI, V3
[9]  
GAUVAIN JL, 1994, IEEE P INT C AC SPEE, P125
[10]   PERFORMANCE COMPARISON BETWEEN SEMICONTINUOUS AND DISCRETE HIDDEN MARKOV-MODELS OF SPEECH [J].
HUANG, XD ;
JACK, MA .
ELECTRONICS LETTERS, 1988, 24 (03) :149-150