CONTEXT-DEPENDENT PHONETIC HIDDEN MARKOV-MODELS FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION

被引:120
作者
LEE, KF
机构
[1] School of Computer Science, Carnegie-Mellon University, Pittsburgh
来源
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING | 1990年 / 38卷 / 04期
基金
美国国家科学基金会;
关键词
D O I
10.1109/29.52701
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The effectiveness of context-dependent phone modeling for speaker-dependent continuous speech recognition has recently been demonstrated. In this study, we apply context-dependent phone models to speaker-independent continuous speech recognition, and show that they are equally effective in this domain. In addition to evaluating several previously proposed context-dependent models, we also introduce two new context-dependent phonetic units: 1) function-word-dependent phone models, which focus on the most difficult subvocabulary, and 2) generalized triphones, which combine similar triphones together based on an information-theoretic measure. The subword clustering procedure used for generalized triphones can find the optimal number of models given a fixed amount of training data. We demonstrate that context-dependent modeling reduces the error rate by as much as 60%. © 1990 IEEE
引用
收藏
页码:599 / 609
页数:11
相关论文
共 41 条
[1]   A MAXIMUM-LIKELIHOOD APPROACH TO CONTINUOUS SPEECH RECOGNITION [J].
BAHL, LR ;
JELINEK, F ;
MERCER, RL .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1983, 5 (02) :179-190
[2]  
BAHL LR, 1988, APR IEEE INT C AC SP
[3]  
BAHL LR, 1980, APR IEEE INT C AC SP
[4]   DRAGON SYSTEM - OVERVIEW [J].
BAKER, JK .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (01) :24-29
[5]  
Baum L. E., 1972, INEQUALITIES, V3, P1
[6]  
BROWN PF, 1987, THESIS CARNEGIE MELL
[7]  
CHEN F, 1988, JUN IEEE WORKSH SPEE
[8]  
CHOW YL, 1986, APR IEEE INT C AC SP
[9]  
CRAVERO M, 1986, APR IEEE INT C AC SP
[10]  
DENG L, 1988, APR P IEEE INT C AC, P509