Weighted finite-state transducers in speech recognition

被引:550
作者
Mohri, M
Pereira, F
Riley, M
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
[2] Univ Penn, Comp & Informat Sci Dept, Philadelphia, PA 19104 USA
关键词
D O I
10.1006/csla.2001.0184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We survey the use of weighted finite-state transducers (WFSTs) in speech recognition. We show that WFSTs provide a common and natural representation for hidden Markov models (HMMs), context-dependency, pronunciation dictionaries, grammars, and alternative recognition outputs. Furthermore, general transducer operations combine these representations flexibly and efficiently. Weighted determinization and minimization algorithms optimize their time and space requirements, and a weight pushing algorithm distributes the weights along the paths of a weighted transducer optimally for speech recognition. As an example, we describe a North American Business News (NAB) recognition system built using these techniques that combines the HMMs, full cross-word triphones, a lexicon of 40 000 words, and a large trigram grammar into a single weighted transducer that is only somewhat larger than the trigram word grammar and that runs NAB in real-time on a very simple decoder. In another example, we show that the same techniques can be used to optimize lattices for second-pass recognition. In a third example, we show how general automata operations can be used to assemble lattices from different recognizers to improve recognition performance. (C) 2002 Academic Press.
引用
收藏
页码:69 / 88
页数:20
相关论文
共 40 条
[1]  
Aho A.V., 1974, The Design and Analysis of Computer Algorithms
[2]  
Aho Alfred V., 1986, ADDISON WESLEY SERIE
[3]  
[Anonymous], 1978, AUTOMATA THEORETIC A
[4]  
[Anonymous], 1996, 34 ANN M ASS COMPUTA
[5]  
Berstel J., 1988, RATIONAL SERIES THEI
[6]  
BEUTNAGEL M, 1999, P EUR, V2, P607
[7]  
Carlyl J. W., 1971, Journal of Computer and System Sciences, V5, P26, DOI 10.1016/S0022-0000(71)80005-3
[8]  
CORMEN TH, 1992, INTRO ALGORITHMS
[9]  
Crochemore M., 1994, TEXT ALGORITHMS
[10]  
CULIK K, 1997, HDB FORMAL LANGUAGES, V3, P599