Exponentially weighted moving average charts for detecting concept drift

被引:263
作者
Ross, Gordon J. [1 ]
Adams, Niall M. [1 ]
Tasoulis, Dimitris K. [1 ]
Hand, David J. [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Math, London SW7 2AZ, England
关键词
Streaming classification; Concept drift; Change detection;
D O I
10.1016/j.patrec.2011.08.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classifying streaming data requires the development of methods which are computationally efficient and able to cope with changes in the underlying distribution of the stream, a phenomenon known in the literature as concept drift. We propose a new method for detecting concept drift which uses an exponentially weighted moving average (EWMA) chart to monitor the misclassification rate of an streaming classifier. Our approach is modular and can hence be run in parallel with any underlying classifier to provide an additional layer of concept drift detection. Moreover our method is computationally efficient with overhead O(1) and works in a fully online manner with no need to store data points in memory. Unlike many existing approaches to concept drift detection, our method allows the rate of false positive detections to be controlled and kept constant over time. (C) 2011 Published by Elsevier B.V.
引用
收藏
页码:191 / 198
页数:8
相关论文
共 24 条
[1]  
[Anonymous], 2000, ICML, DOI DOI 10.1007/978-3-540-44871-6_130
[2]   Paired Learners for Concept Drift [J].
Bach, Stephen H. ;
Maloof, Marcus A. .
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :23-32
[3]  
Baena-Garcia M., 2006, LECT NOTES ARTIF INT, V3171, P286
[4]  
Basseville M, 1993, DETECTION ABRUPT CHA
[5]  
Bell C, 1994, INST MATH S, V23, P7, DOI 10.1214/lnms/1215463111
[6]   Approximate statistical tests for comparing supervised classification learning algorithms [J].
Dietterich, TG .
NEURAL COMPUTATION, 1998, 10 (07) :1895-1923
[7]   A general framework for mining massive data streams [J].
Domingos, P ;
Hulten, G .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2003, 12 (04) :945-949
[8]  
Gama J, 2004, LECT NOTES ARTIF INT, V3171, P286
[9]   STATISTICAL AND STRUCTURAL APPROACHES TO TEXTURE [J].
HARALICK, RM .
PROCEEDINGS OF THE IEEE, 1979, 67 (05) :786-804
[10]  
Harries M., 1999, SPLICE 2 COMP EVALUA