CONTEXT-DEPENDENT PHONETIC HIDDEN MARKOV-MODELS FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION

被引：120

作者：

LEE, KF

机构：

[1] School of Computer Science, Carnegie-Mellon University, Pittsburgh

来源：

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING | 1990年 / 38卷 / 04期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/29.52701

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The effectiveness of context-dependent phone modeling for speaker-dependent continuous speech recognition has recently been demonstrated. In this study, we apply context-dependent phone models to speaker-independent continuous speech recognition, and show that they are equally effective in this domain. In addition to evaluating several previously proposed context-dependent models, we also introduce two new context-dependent phonetic units: 1) function-word-dependent phone models, which focus on the most difficult subvocabulary, and 2) generalized triphones, which combine similar triphones together based on an information-theoretic measure. The subword clustering procedure used for generalized triphones can find the optimal number of models given a fixed amount of training data. We demonstrate that context-dependent modeling reduces the error rate by as much as 60%. © 1990 IEEE

引用

页码：599 / 609

页数：11

共 41 条

[1] A MAXIMUM-LIKELIHOOD APPROACH TO CONTINUOUS SPEECH RECOGNITION [J].

BAHL, LR ;

JELINEK, F ;

MERCER, RL .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1983, 5 (02) :179-190

[2]

BAHL LR, 1988, APR IEEE INT C AC SP

[3]

BAHL LR, 1980, APR IEEE INT C AC SP

[4] DRAGON SYSTEM - OVERVIEW [J].

BAKER, JK .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (01) :24-29

[5]

Baum L. E., 1972, INEQUALITIES, V3, P1

[6]

BROWN PF, 1987, THESIS CARNEGIE MELL

[7]

CHEN F, 1988, JUN IEEE WORKSH SPEE

[8]

CHOW YL, 1986, APR IEEE INT C AC SP

[9]

CRAVERO M, 1986, APR IEEE INT C AC SP

[10]

DENG L, 1988, APR P IEEE INT C AC, P509

← 1 2 3 4 5 →