Bayesian Network Classifiers

被引:343
作者
Nir Friedman
Dan Geiger
Moises Goldszmidt
机构
[1] University of California,Computer Science Division
[2] Technion,Computer Science Department
[3] SRI International,undefined
来源
Machine Learning | 1997年 / 29卷
关键词
Bayesian networks; classification;
D O I
暂无
中图分类号
学科分类号
摘要
Recent work in supervised learning has shown that a surprisingly simple Bayesian classifier with strong assumptions of independence among features, called naive Bayes, is competitive with state-of-the-art classifiers such as C4.5. This fact raises the question of whether a classifier with less restrictive assumptions can perform even better. In this paper we evaluate approaches for inducing classifiers from data, based on the theory of learning Bayesian networks. These networks are factored representations of probability distributions that generalize the naive Bayesian classifier and explicitly represent statements about independence. Among these approaches we single out a method we call Tree Augmented Naive Bayes (TAN), which outperforms naive Bayes, yet at the same time maintains the computational simplicity (no search involved) and robustness that characterize naive Bayes. We experimentally tested these approaches, using problems from the University of California at Irvine repository, and compared them to C4.5, naive Bayes, and wrapper methods for feature selection.
引用
收藏
页码:131 / 163
页数:32
相关论文
共 24 条
  • [1] Buntine W.(1996)A guide to the literature on learning probabilistic networks from data IEEE Trans. on Knowledge and Data Engineering 8 195-210
  • [2] Chow C. K.(1968)Approximating discrete probability distributions with dependence trees IEEE Trans. on Info. Theory 14 462-467
  • [3] Liu C. N.(1992)A Bayesian method for the induction of probabilistic networks from data Machine Learning 9 309-347
  • [4] Cooper G. F.(1976)Properties of diagnostic data distributions Biometrics 32 647-658
  • [5] Herskovits E.(1997)On bias, variance, 0/1-loss, and the curse-of-dimensionality Data Mining and Knowledge Discovery 1 55-77
  • [6] Dawid A. P.(1996)Knowledge representation and inference in similarity networks and Bayesian multinets Artificial Intelligence 82 45-74
  • [7] Friedman J.(1995)Learning Bayesian networks: The combination of knowledge and statistical data Machine Learning 20 197-243
  • [8] Geiger D.(1951)On information and sufficiency Annals of Mathematical Statistics 22 76-86
  • [9] Heckerman D.(1994)Learning Bayesian belief networks. An approach based on the MDL principle Computational Intelligence 10 269-293
  • [10] Heckerman D.(1995)The EM algorithm for graphical association models with missing data Computational Statistics and Data Analysis 19 191-201