Bagging, subagging and Bragging for improving some prediction algorithms

被引:44
作者
Bühlmann, P [1 ]
机构
[1] ETH, Seminar Stat, CH-8092 Zurich, Switzerland
来源
RECENT ADVANCES AND TRENDS IN NONPARAMETRIC STATISTICS | 2003年
关键词
D O I
10.1016/B978-044451378-6/50002-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Bagging (bootstrap aggregating), proposed by Breiman [1], is a method to improve the predictive power of some special estimators or algorithms such as regression or classification trees. First, we review a recently developed theory explaining why bagging decision trees, or also the subagging (subsample aggregating) variant, yields smooth decisions, reducing the variance and mean squared error. We then propose bragging (bootstrap robust aggregating) as a new version of bagging which, in contrast to bagging, is empirically demonstrated to improve also the MARS algorithm which itself already yields continuous function estimates. Finally, bagging is demonstrated as a "module" in conjunction with boosting for an example about tumor classification using microarray gone expressions.
引用
收藏
页码:19 / 34
页数:16
相关论文
共 12 条
[1]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[2]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[3]  
Bühlmann P, 2000, ANN STAT, V28, P377
[4]  
Bühlmann P, 2002, ANN STAT, V30, P927
[5]  
BUJA A, 2002, OBSERVATIONS BAGGING
[6]  
CHEN SX, 2003, IN PRESS STAT SINICA
[7]  
DETTLING M, 2003, UNPUB BAGBOOSTING TU
[8]  
Freund Y, 1996, Experiments with a new boosting algorithm. In proceedings 13th Int Conf Mach learn. Pp.148-156, P45
[9]   Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407
[10]   Stochastic gradient boosting [J].
Friedman, JH .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) :367-378