Bagging for linear classifiers

被引:113
作者
Skurichina, M
Duin, RPW
机构
[1] Delft Univ Technol, Fac Sci Appl, Dept Appl Phys, Pattern Recognit Grp, NL-2600 GA Delft, Netherlands
[2] Inst Math & Informat, Dept Data Anal, LT-2600 Vilnius, Lithuania
关键词
linear discriminant; generalization error; small sample size; regularization; bagging; instability; bias and variance;
D O I
10.1016/S0031-3203(97)00110-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classifiers built on small training sets are usually biased or unstable. Different techniques exist to construct more stable classifiers. It is not clear which ones are good, and whether they really stabilize the classifier or just improve the performance. In this paper bagging (bootstrapping and aggregating) [L. Breiman, Bagging predictors, Machine Learning J. 24(2), 123-140(1996)] is studied for a number of linear classifiers. A measure for the instability of classifiers is introduced. The influence of regularization and bagging on this instability and the generalization error of linear classifiers is investigated. In a simulation study it is shown that in general bagging is not a stabilizing technique. It is also demonstrated that one can consider the instability of the classifier to predict how useful bagging will be. Finally, it is shown experimentally that bagging might improve the performance of the classifier only for very unstable situations. (C) 1998 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:909 / 930
页数:22
相关论文
共 29 条
[1]  
Aivazian S.A., 1989, APPL STAT CLASSIFICA
[2]  
BARSOV DM, 1985, STAT PROBABILITY EC, P376
[3]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[4]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[5]  
DUIN RPW, 1995, P 9 SCAND C IM AN UP
[6]  
DUIN RPW, 1978, THESIS DELFT U TECHN
[7]  
Efron B., 1994, INTRO BOOTSTRAP, V57, DOI DOI 10.1201/9780429246593
[8]  
Fisher R., 1936, ANN EUGENICS, V7
[9]  
FISHER RA, 1940, ANN EUGEN, V10
[10]   REGULARIZED DISCRIMINANT-ANALYSIS [J].
FRIEDMAN, JH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (405) :165-175