Hierarchical search for large-vocabulary conversational speech recognition - Working toward a solution to the decoding problem

被引:21
作者
Deshmukh, N [1 ]
Ganapatkiraju, A
Picone, J
机构
[1] Mississippi State Univ, Dept Elect & Comp Engn, Core Speech Technol Team, Mississippi State, MS 39762 USA
[2] Mississippi State Univ, Inst Signal & Informat Proc, Mississippi State, MS 39762 USA
关键词
D O I
10.1109/79.790985
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The aim of a continuous speech-recognition system is to provide an efficient and accurate mechanism to transcribe human speech into text. To make this system ubiquitous, it is important that the system must be able to handle a large vocabulary, and be independent of speaker and language characteristics such as accents, speaking styles, dysfluencies, syntax, and grammar. The problems of search for large vocabulary continuous speech recognition (LVCSR) systems are introduced. A typical implementation of a search engine is discussed and the efficacy if this approach on a range of problems is demonstrated.
引用
收藏
页码:84 / 107
页数:24
相关论文
共 69 条
[1]  
ALLEVA F, 1992, P DARPA SPEECH NAT L, P393
[2]   EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION [J].
ATAL, BS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06) :1304-1312
[3]  
Bahl L., 1991, P DARPA SPEECH NAT L, P264
[4]   A TREE-BASED STATISTICAL LANGUAGE MODEL FOR NATURAL-LANGUAGE SPEECH RECOGNITION [J].
BAHL, LR ;
BROWN, PF ;
DESOUZA, PV ;
MERCER, RL .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (07) :1001-1008
[5]  
BAHL RL, 1989, P IEEE INT C AC SPEE, P465
[6]  
Baumgarte J., 1972, Computer Methods in Applied Mechanics and Engineering, V1, P1, DOI 10.1016/0045-7825(72)90018-7
[7]  
BROWNING S, 1993, 4666 DRA
[8]  
Chen H P, 1994, Bioorg Med Chem, V2, P1, DOI 10.1016/S0968-0896(00)82195-1
[9]  
Chollet G. F., 1982, Proceedings of ICASSP 82. IEEE International Conference on Acoustics, Speech and Signal Processing, P2026
[10]  
Chow Y. L., 1987, Proceedings: ICASSP 87. 1987 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.87CH2396-0), P89