Being Bayesian about network structure. A Bayesian approach to structure discovery in Bayesian networks

被引:515
作者
Friedman, N [1 ]
Koller, D
机构
[1] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
Bayesian networks; structure learning; MCMC; Bayesian model averaging;
D O I
10.1023/A:1020249912095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many multivariate domains, we are interested in analyzing the dependency structure of the underlying distribution, e.g., whether two variables are in direct interaction. We can represent dependency structures using Bayesian network models. To analyze a given data set, Bayesian model selection attempts to find the most likely (MAP) model, and uses its structure to answer these questions. However, when the amount of available data is modest, there might be many models that have non-negligible posterior. Thus, we want compute the Bayesian posterior of a feature, i.e., the total posterior probability of all models that contain it. In this paper, we propose a new approach for this task. We first show how to efficiently compute a sum over the exponential number of networks that are consistent with a fixed order over network variables. This allows us to compute, for a given order, both the marginal probability of the data and the posterior of a feature. We then use this result as the basis for an algorithm that approximates the Bayesian posterior of a feature. Our approach uses a Markov Chain Monte Carlo (MCMC) method, but over orders rather than over network structures. The space of orders is smaller and more regular than the space of structures, and has much a smoother posterior "landscape". We present empirical results on synthetic and real-life datasets that compare our approach to full model averaging (when possible), to MCMC over network structures, and to a non-Bayesian bootstrap approach.
引用
收藏
页码:95 / 125
页数:31
相关论文
共 29 条
[1]  
[Anonymous], MSRTR9705
[2]  
[Anonymous], 1998, Learning in Graphical Models, chapter A tutorial on learning with Bayesian networks
[3]  
[Anonymous], P 11 C UNC ART INT
[4]  
Beinlich I., 1989, P 2 EUR C AI MED
[5]   A guide to the literature on learning probabilistic networks from data [J].
Buntine, W .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (02) :195-210
[6]  
Buntine W., 1991, P 7 C UNC ART INT, P52
[7]   Rao-Blackwellisation of sampling schemes [J].
Casella, G ;
Robert, CP .
BIOMETRIKA, 1996, 83 (01) :81-94
[8]  
COOPER GF, 1992, MACH LEARN, V9, P309, DOI 10.1007/BF00994110
[9]   Using Bayesian networks to analyze expression data [J].
Friedman, N ;
Linial, M ;
Nachman, I ;
Pe'er, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :601-620
[10]  
Friedman N, 1999, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, P206