Averaging over decision trees

被引:5
作者
Oliver, JJ
Hand, D
机构
[1] Department of Statistics, Open University, Walton Hall, Milton Keynes
关键词
decision trees; classification trees; averaging; minimum message length; Bayesian trees;
D O I
10.1007/BF01246103
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Pruning a decision tree is considered by some researchers to be the most important part of tree building in noisy domains. While there are many approaches to pruning, the alternative of averaging over decision trees has not received as much attention. The basic idea of tree averaging is to produce a weighted sum of decisions. We consider the set of trees used for the averaging process, and how weights should be assigned to each tree in this set. We define the concept of a fanned set for a tree, and examine how the Minimum Message Length paradigm of learning may be used to average over decision trees. We perform an empirical evaluation of two averaging approaches, and a Minimum Message Length approach.
引用
收藏
页码:281 / 297
页数:17
相关论文
共 28 条
  • [11] GEORGEFF MP, 1984, P 6 EUR C ART INT EC, P473
  • [12] Hand D J, 1992, Stat Methods Med Res, V1, P49, DOI 10.1177/096228029200100104
  • [13] HASTIE TJ, 1990, SHRINKING TREES
  • [14] KWOK SW, 1990, UNCERTAINTY ARTIFICI, V4, P327
  • [15] McLachlan G. J., 2005, Discriminant analysis and statistical pattern recognition
  • [16] Oliver J. J., 1992, Proceedings of the 5th Australian Joint Conference on Artificial Intelligence. AI '92, P361
  • [17] OLIVER JJ, 1991, P WORKSH 8 EV CHANG
  • [18] OLIVER JJ, 1994, 944 OP U WALT HALL D
  • [19] Quinlan J. R., 1992, Proceedings of the 5th Australian Joint Conference on Artificial Intelligence. AI '92, P343
  • [20] Quinlan J. R., 1986, Machine Learning, V1, P81, DOI 10.1007/BF00116251