The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift

被引:300
作者
Minku, Leandro L. [1 ]
White, Allan P. [2 ]
Yao, Xin [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, CERCIA, Birmingham B15 2TT, W Midlands, England
[2] Univ Birmingham, Sch Math & Stat, Birmingham B15 2TT, W Midlands, England
关键词
Concept drift; online learning; neural network ensembles; diversity;
D O I
10.1109/TKDE.2009.156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online learning algorithms often have to operate in the presence of concept drift (i.e., the concepts to be learned can change with time). This paper presents a new categorization for concept drift, separating drifts according to different criteria into mutually exclusive and nonheterogeneous categories. Moreover, although ensembles of learning machines have been used to learn in the presence of concept drift, there has been no deep study of why they can be helpful for that and which of their features can contribute or not for that. As diversity is one of these features, we present a diversity analysis in the presence of different types of drifts. We show that, before the drift, ensembles with less diversity obtain lower test errors. On the other hand, it is a good strategy to maintain highly diverse ensembles to obtain lower test errors shortly after the drift independent on the type of drift, even though high diversity is more important for more severe drifts. Longer after the drift, high diversity becomes less important. Diversity by itself can help to reduce the initial increase in error caused by a drift, but does not provide the faster recovery from drifts in long-term.
引用
收藏
页码:730 / 742
页数:13
相关论文
共 50 条
[1]  
Abdulsalam H, 2007, INT DATABASE ENG APP, P225
[2]  
[Anonymous], P ACM SIGKDD
[3]  
[Anonymous], 2005, P 2 INT WORKSHOP KNO
[4]  
Baena-Garcia M, 2006, 4 INT WORKSH KNOWL D, V6, P77
[5]  
Blake C. L., 1998, Uci repository of machine learning databases
[6]  
Branke J., 2002, EVOLUTIONARY OPTIMIZ
[7]  
BRANKE J, 1999, 387 U KARLSR I APPL
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]  
Brown G., 2005, Information Fusion, V6, P5, DOI 10.1016/j.inffus.2004.04.004