Adaptive Probabilistic Networks with Hidden Variables

被引:16
作者
John Binder
Daphne Koller
Stuart Russell
Keiji Kanazawa
机构
[1] University of California,Computer Science Division
[2] Stanford University,Computer Science Department
[3] University of California,Computer Science Division
[4] Microsoft Corporation,undefined
来源
Machine Learning | 1997年 / 29卷
关键词
Bayesian networks; gradient descent; prior knowledge; dynamic networks; hybrid networks;
D O I
暂无
中图分类号
学科分类号
摘要
Probabilistic networks (also known as Bayesian belief networks) allow a compact description of complex stochastic relationships among several random variables. They are used widely for uncertain reasoning in artificial intelligence. In this paper, we investigate the problem of learning probabilistic networks with known structure and hidden variables. This is an important problem, because structure is much easier to elicit from experts than numbers, and the world is rarely fully observable. We present a gradient-based algorithm and show that the gradient can be computed locally, using information that is available as a byproduct of standard inference algorithms for probabilistic networks. Our experimental results demonstrate that using prior knowledge about the structure, even with hidden variables, can significantly improve the learning rate of probabilistic networks. We extend the method to networks in which the conditional probability tables are described using a small number of parameters. Examples include noisy-OR nodes and dynamic probabilistic networks. We show how this additional structure can be exploited by our algorithm to speed up the learning even further. We also outline an extension to hybrid networks, in which some of the nodes take on values in a continuous domain.
引用
收藏
页码:213 / 244
页数:31
相关论文
共 45 条
  • [11] Friedman N.(1996)Using hidden nodes in Bayesian networks Artificial Intelligence 88 1-38
  • [12] Geiger D.(1990)Adapting connectionist learning to Bayes networks International Journal of Approximate Reasoning 4 261-282
  • [13] Goldszmidt M.(1995)The EM algorithm for graphical association models with missing data Computational Statistics and Data Analysis 19 191-201
  • [14] Ghahramani Z.(1989)Graphical models for associations between variables, some of which are qualitative and some quantitative Annals of Statistics 17 31-57
  • [15] Jordan M. I.(1988)Local computations with probabilities on graphical structures and their application to expert systems Journal of the Royal Statistical Society 50 157-224
  • [16] Golmard J.-L.(1992)A practical Bayesian framework for back-propagation networks Neural Computation 4 448-472
  • [17] Mallet A.(1992)Asymmetric parallel Boltzmann machines are belief networks Neural Computation 4 832-834
  • [18] Heckerman D.(1992)Connectionist learning of belief networks Artificial Intelligence 56 71-113
  • [19] Wellman M.(1990)Regularization algorithms for learning that are equivalent to multilayer networks Science 247 978-982
  • [20] Kwoh C.-K.(1989)Gaussian influence diagrams Management Science 35 527-550