Functional trees

被引:203
作者
Gama, J [1 ]
机构
[1] Univ Porto, FEP, LIACC, P-4150 Oporto, Portugal
关键词
multivariate decision trees; multiple models; supervised learning;
D O I
10.1023/B:MACH.0000027782.67192.13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of classification problems, algorithms that generate multivariate trees are able to explore multiple representation languages by using decision tests based on a combination of attributes. In the regression setting, model trees algorithms explore multiple representation languages but using linear models at leaf nodes. In this work we study the effects of using combinations of attributes at decision nodes, leaf nodes, or both nodes and leaves in regression and classification tree learning. In order to study the use of functional nodes at different places and for different types of modeling, we introduce a simple unifying framework for multivariate tree learning. This framework combines a univariate decision tree with a linear function by means of constructive induction. Decision trees derived from the framework are able to use decision nodes with multivariate tests, and leaf nodes that make predictions using linear functions. Multivariate decision nodes are built when growing the tree, while functional leaves are built when pruning the tree. We experimentally evaluate a univariate tree, a multivariate tree using linear combinations at inner and leaf nodes, and two simplified versions restricting linear combinations to inner nodes and leaves. The experimental evaluation shows that all functional trees variants exhibit similar performance, with advantages in different datasets. In this study there is a marginal advantage of the full model. These results lead us to study the role of functional leaves and nodes. We use the bias-variance decomposition of the error, cluster analysis, and learning curves as tools for analysis. We observe that in the datasets under study and for classification and regression, the use of multivariate decision nodes has more impact in the bias component of the error, while the use of multivariate decision leaves has more impact in the variance component.
引用
收藏
页码:219 / 250
页数:32
相关论文
共 41 条
  • [1] [Anonymous], 1996, P 2 INT C KNOWLEDGE
  • [2] [Anonymous], 2002, LNCS, DOI DOI 10.1007/3-540-45681-3
  • [3] [Anonymous], INTELLIGENT DATA ANA
  • [4] Bhattacharyya G. K., 1977, STAT CONCEPTS METHOD
  • [5] Blake C., 1999, Uci repository of machine learning data sets
  • [6] Breiman L, 1998, ANN STAT, V26, P801
  • [7] Bagging predictors
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140
  • [8] Breiman L., 1998, CLASSIFICATION REGRE
  • [9] RECURSIVE AUTOMATIC BIAS SELECTION FOR CLASSIFIER CONSTRUCTION
    BRODLEY, CE
    [J]. MACHINE LEARNING, 1995, 20 (1-2) : 63 - 94
  • [10] BRODLEY CE, 1995, MACH LEARN, V19, P45, DOI 10.1007/BF00994660