AN OVERVIEW OF THE SPHINX SPEECH RECOGNITION SYSTEM

被引:166
作者
LEE, KF
HON, HW
REDDY, R
机构
[1] School of Computer Science, Carnegie Mellon University, Pittsburgh
来源
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING | 1990年 / 38卷 / 01期
基金
美国国家科学基金会;
关键词
D O I
10.1109/29.45616
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker independence, continuous speech, and large vocabularies pose three of the greatest challenges in automatic speech recognition. Previously, accurate speech recognizers avoided dealing simultaneously with all three problems. This paper describes SPHINX, a system that demonstrates the feasibility of accurate, large-vocabulary speaker-independent, continuous speech recognition. SPHINX is based on discrete hidden Markov models (HMM's) with LPC-derived parameters. To provide speaker independence, we added knowledge to these HMM's in several ways: multiple codebooks of fixed-width parameters, and an enhanced recognizer with carefully designed models and word duration modeling. To deal with coarticulation in continuous speech, yet still adequately represent a large vocabulary, we introduce two new subword speech units—function-word-dependent phone models and generalized triphone models. With grammars of perplexity 997, 60, and 20, SPHINX attained word accuracies of 71, 94, and 96 percent on a 997-word task. © 1990 IEEE
引用
收藏
页码:35 / 45
页数:11
相关论文
共 48 条
[11]  
Duda R. O., 1973, PATTERN CLASSIFICATI, V3
[12]  
Fisher W.M., 1987, 113 M AC SOC AM
[13]   SPEAKER-INDEPENDENT ISOLATED WORD RECOGNITION USING DYNAMIC FEATURES OF SPEECH SPECTRUM [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (01) :52-59
[14]  
GUPTA VN, 1987, APR P IEEE INT C AC, P697
[15]   CONTINUOUS SPEECH RECOGNITION BY STATISTICAL-METHODS [J].
JELINEK, F .
PROCEEDINGS OF THE IEEE, 1976, 64 (04) :532-556
[16]  
Jelinek F., 1980, Pattern Recognition in Practice. Proceedings of an International Workshop, P381
[17]  
JELINEK F, 1985, MAR P IEEE INT C AC
[18]  
Lee K. F., 1989, AUTOMATIC SPEECH REC
[19]  
LEE KF, 1988, APR P IEE INT C AC S
[20]  
LEE KF, UNPUB IEEE T ACOUST