Transfer estimation of evolving class priors in data stream classification

被引:25
作者
Zhang, Zhihao [1 ]
Zhou, Jie [1 ]
机构
[1] Tsinghua Univ, Dept Automat, State Key Lab Intelligent Technol & Syst, Tsinghua Natl Lab Informat Sci & Technol TNList, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Concept drift; Transfer learning; Prior estimation; PRIOR PROBABILITIES;
D O I
10.1016/j.patcog.2010.03.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream classification is a hot topic in data mining research. The great challenge is that the class priors may evolve along the data sequence. Algorithms have been proposed to estimate the dynamic class priors and adjust the classifier accordingly. However, the existing algorithms do not perform well on prior estimation due to the lack of samples from the target distribution. Sample size has great effects in parameter estimation and small-sample effects greatly contaminate the estimation performance. In this paper, we propose a novel parameter estimation method called transfer estimation. Transfer estimation makes use of samples not only from the target distribution but also from similar distributions. We apply this new estimation method to the existing algorithms and obtain an improved algorithm. Experiments on both synthetic and real data sets show that the improved algorithm outperforms the existing algorithms on both class prior estimation and classification. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3151 / 3161
页数:11
相关论文
共 39 条
  • [1] [Anonymous], 1999, ICML
  • [2] [Anonymous], 2007, Proceedings of the 20th International Conference on Neural Information Processing Systems
  • [3] [Anonymous], 2006, Pattern recognition and machine learning
  • [4] Strategies for learning in class imbalance problems
    Barandela, R
    Sánchez, JS
    García, V
    Rangel, E
    [J]. PATTERN RECOGNITION, 2003, 36 (03) : 849 - 851
  • [5] Batista G. E., 2004, ACM SIGKDD Explor. Newslett., P20, DOI [10.1145/1007730.1007735, DOI 10.1145/1007730.1007735]
  • [6] ON ESTIMATION OF MIXING DISTRIBUTIONS
    BOES, DC
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1966, 37 (01): : 177 - &
  • [7] Csiszar I, 1967, Stud Sci Math Hung, V2, P229
  • [8] Dai W., 2007, P 24 INT C MACH LEAR, P193, DOI [10.1145/1273496.1273521, DOI 10.1145/1273496.1273521]
  • [9] Dai Wenyuan, 2008, NEURIPS, P353
  • [10] ASYMPTOTICALLY EFFICIENT ESTIMATION OF PRIOR PROBABILITIES IN MULTICLASS FINITE MIXTURES
    DATTATREYA, GR
    KANAL, LN
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1991, 37 (03) : 482 - 489