Parallelizing AdaBoost by weights dynamics

被引:27
作者
Merler, Stefano [1 ]
Caprile, Bruno [1 ]
Furlanello, Cesare [1 ]
机构
[1] IRST, ITC, Ctr Ric Sci & Tecnol, I-38050 Povo, Trento, Italy
关键词
AdaBoost; parallelization; weights dynamics; boosting; classification;
D O I
10.1016/j.csda.2006.09.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
AdaBoost is one of the most popular classification methods. In contrast to other ensemble methods (e.g., Bagging) the AdaBoost is inherently sequential. In many data intensive real-world situations this may limit the practical applicability of the method. P-AdaBoost is a novel scheme for the parallelization of AdaBoost, which builds upon earlier results concerning the dynamics of AdaBoost weights. P-AdaBoost yields approximations to the standard AdaBoost models that can be easily and efficiently distributed over a network of computing nodes. Properties of P-AdaBoost as a stochastic minimizer of the AdaBoost cost functional are discussed. Experiments are reported on both synthetic and benchmark data sets. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:2487 / 2498
页数:12
相关论文
共 23 条
[1]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[2]  
Blake C.L., 1998, UCI repository of machine learning databases
[3]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[4]  
Caprile B, 2002, LECT NOTES COMPUT SC, V2364, P72
[5]   Logistic regression, AdaBoost and Bregman distances [J].
Collins, M ;
Schapire, RE ;
Singer, Y .
MACHINE LEARNING, 2002, 48 (1-3) :253-285
[6]   An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization [J].
Dietterich, TG .
MACHINE LEARNING, 2000, 40 (02) :139-157
[7]  
Efron B., 1993, MONOGR STAT APPL PRO, V57
[8]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[9]   Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407
[10]  
KONTOGHIORGHES EJ, 1999, J COMPUTATIONAL STAT, V31