SYNTACTIC DECISION RULES FOR RECOGNITION OF SPOKEN WORDS AND PHRASES USING A STOCHASTIC AUTOMATON

被引:12
作者
KASHYAP, RL
机构
[1] Department of Electrical Engineering, Purdue University, West Lafayette
关键词
distance between strings; Index Terms-Comparison of symbol strings; most probable word; speech recognition; stochastic automation; syntactic pattern recognition; word recognition;
D O I
10.1109/TPAMI.1979.4766901
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study deals with the design of a syntactic decision rule for recognizing an unknown utterance from a set X. The decision rule is expressed as a function of the character string (CS) derived from the test utterance. To obtain the CS, the waveform of the utterance is divided into a large number of frames of roughly equal duration numbered 1, 2, …, n. The ith symbol in the CS is the phonemic symbol obtained by subjecting the ith frame of the waveform to a relatively simple phoneme decision rule, the number of symbols in the CS being n. All the available nonacoustic information such as the lexicon of words in the set X, the possibility of confusion between different phonemes as seen by the phoneme decision rule, etc. is used in the design of the decision rule. The syntactic decision rule can be implemented by a stochastic finite state automaton involving limited memory and computation. The decision rule can also be interpreted as yielding the phrase x which minimizes a distance measure D(x, z) between the phrase x ∊X and the observed CS z. We will compare this approach with the other approaches such as the Viterbi methods, the distance approaches involving various types of distances, etc. Copyright © 1979 by The Institute of Electrical and Electronics Engineers, Inc.
引用
收藏
页码:154 / 163
页数:10
相关论文
共 13 条
[1]   CONTINUOUS SPEECH RECOGNITION BY STATISTICAL-METHODS [J].
JELINEK, F .
PROCEEDINGS OF THE IEEE, 1976, 64 (04) :532-556
[2]  
KASHYAP RL, 1978, IEEE T COMPUT, V27, P442, DOI 10.1109/TC.1978.1675124
[3]  
KASHYAP RL, 1976, TREE7628 PURD U SCH
[4]   VITERBI ALGORITHM AS AN AID IN TEXT RECOGNITION [J].
NEUHOFF, DL .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1975, 21 (02) :222-226
[5]   METHOD FOR CORRECTION OF GARBLED WORDS BASED ON LEVENSHTEIN METRIC [J].
OKUDA, T ;
TANAKA, E ;
KASAI, T .
IEEE TRANSACTIONS ON COMPUTERS, 1976, 25 (02) :172-178
[6]  
RISEMAN EM, 1974, IEEE T COMPUT, V23, P490
[8]  
Sellers P. H., 1974, Journal of Combinatorial Theory, Series A, V16, P253, DOI 10.1016/0097-3165(74)90050-8
[9]  
SILVERMAN HF, 1975, IEEE T ACOUST SPEECH, V23, P87
[10]   ERRORS IN REGULAR LANGUAGES [J].
THOMASON, MG .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (06) :597-602