BAYESIAN BELIEF NETWORKS AS A TOOL FOR STOCHASTIC PARSING

被引:9
作者
LUCKE, H
机构
[1] ATR Interpreting Telecommunications Research Laboratories, Soraku-gun, Kyoto, 2-2 Hikaridai, Seika-cho
关键词
BAYESIAN NETWORKS; GRAMMAR INFERENCE; STOCHASTIC PARSING;
D O I
10.1016/0167-6393(94)00046-D
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Bayesian Belief Networks are a powerful tool for combining different knowledge sources with various degrees of uncertainty in a mathematical sound and computationally efficient way. Surprisingly they have not yet found their way into the speech processing field, despite the fact that in this science multiple unreliable information sources exist. The present paper shows how the theory can be utilized in for language modeling. After providing an introduction to the theory of Bayesian Networks, we develop several extensions to the classic theory by describing mechanisms for dealing with statistical dependence among daughter nodes (usually assumed to be conditionally independent) and by providing a learning algorithm based on the EM-algorithm with which the probabilities of link matrices can be learned from example data. Using these extensions a language model for speech recognition based on a context-free framework is constructed. In this model, sentences are not parsed in their entirety, as is usual with grammatical description, but only ''locally'' on suitably located segments. The model was evaluated over a text data base. In terms of test set entropy the model performed at least as good as the bi/tri-gram models, while showing a good ability to generalize from training to test data.
引用
收藏
页码:89 / 118
页数:30
相关论文
共 24 条
[1]  
ANDERSON JR, 1981, 7TH P INT JOINT C AR, P97
[2]  
BAHL LR, 1989, IEEE T ACOUST SPEECH, V37
[3]  
Baker J. K., 1979, 97 M AC SOC AM, P547
[4]  
BERWICK R, 1980, 16TH P ANN M ASS COM
[5]  
Brown P. F., 1982, Proceedings of ICASSP 82. IEEE International Conference on Acoustics, Speech and Signal Processing, P1629
[6]  
Chomsky Noam, 1959, INFORM CONTROL, V2, P137, DOI 10.1016/S0019-9958(59)90362-6
[7]   SEQUENTIAL MODEL CRITICISM IN PROBABILISTIC EXPERT SYSTEMS [J].
COWELL, RG ;
DAWID, AP ;
SPIEGELHALTER, DJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (03) :209-219
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]  
EHARA T, 1990, I0186 ATR INT TEL RE
[10]  
FU KS, 1986, IEEE PATTERN ANAL MA, V8