Effective classification of noisy data streams with attribute-oriented dynamic classifier selection

被引:25
作者
Zhu, XQ [1 ]
Wu, XD
Yang, Y
机构
[1] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
[2] Monash Univ, Sch Comp Sci & Software Engn, Melbourne, Vic 3004, Australia
关键词
stream data mining; classification; dynamic classifier selection; classifier ensemble; multiple classifier systems; class noise;
D O I
10.1007/s10115-005-0212-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, mining from data streams has become an important and challenging task for many real-world applications such as credit card fraud protection and sensor networking. One popular solution is to separate stream data into chunks, learn a base classifier from each chunk, and then integrate all base classifiers for effective classification. In this paper, we propose a new dynamic classifier selection (DCS) mechanism to integrate base classifiers for effective mining from data streams. The proposed algorithm dynamically selects a single "best" classifier to classify each test instance at run time. Our scheme uses statistical information from attribute values, and uses each attribute to partition the evaluation set into disjoint subsets, followed by a procedure that evaluates the classification accuracy of each base classifier on these subsets. Given a test instance, its attribute values determine the subsets that the similar instances in the evaluation set have constructed, and the classifier with the highest classification accuracy on those subsets is selected to classify the test instance. Experimental results and comparative studies demonstrate the efficiency and efficacy of our method. Such a DCS scheme appears to be promising in mining data streams with dramatic concept drifting or with a significant amount of noise, where the base classifiers are likely conflictive or have low confidence.
引用
收藏
页码:339 / 363
页数:25
相关论文
共 38 条
[1]  
ALI K, 1996, MACHINE LEARNING
[2]  
BALKE C, 1998, UCI DATA REPOSITORY
[3]  
BREIMAN L, 1984, GLASSIFICATION REGRE
[4]  
Breiman L, 1996, MACH LEARN, P24
[5]   RECURSIVE AUTOMATIC BIAS SELECTION FOR CLASSIFIER CONSTRUCTION [J].
BRODLEY, CE .
MACHINE LEARNING, 1995, 20 (1-2) :63-94
[6]  
BRODLEY CE, 1996, P INT GEOSC REM SENS, V1, P532
[7]  
CHAN PKW, 1996, THESIS COLUMBIA U
[8]  
CHING JY, 1995, IEEE T PAMI, V17
[9]  
DOMINGOS P, 2000, P SIGKDD
[10]  
Dougherty J., 1995, MACHINE LEARNING P 1, P194, DOI DOI 10.1016/B978-1-55860-377-6.50032-3