An Assessment of Case-Based Reasoning for Spam Filtering

被引:8
作者
Sarah Jane Delany
Pádraig Cunningham
Lorcan Coyle
机构
[1] Dublin Institute of Technology,Trinity College
[2] University of Dublin,undefined
[3] University College Dublin,undefined
来源
Artificial Intelligence Review | 2005年 / 24卷
关键词
case base reasoning; spam filtering;
D O I
暂无
中图分类号
学科分类号
摘要
Because of the changing nature of spam, a spam filtering system that uses machine learning will need to be dynamic. This suggests that a case-based (memory-based) approach may work well. Case-Based Reasoning (CBR) is a lazy approach to machine learning where induction is delayed to run time. This means that the case base can be updated continuously and new training data is immediately available to the induction process. In this paper we present a detailed description of such a system called ECUE and evaluate design decisions concerning the case representation. We compare its performance with an alternative system that uses Naïve Bayes. We find that there is little to choose between the two alternatives in cross-validation tests on data sets. However, ECUE does appear to have some advantages in tracking concept drift over time.
引用
收藏
页码:359 / 378
页数:19
相关论文
共 13 条
  • [1] Bradley A.(1997)The Use of the Area Under the ROC Curve in the Evaluation of Machine Learning Algorithms Pattern Recognition 30 1145-1150
  • [2] Brighton H.(2002)Advances in Instance Selection for Instance-Based Learning Algorithms Data Mining and Knowledge Discovery 62 153-172
  • [3] Mellish C.(1998)Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms Neural Computing 10 1895-1923
  • [4] Dietterich D.T.(1999)Support Vector Machines for Spam Categorisation IEEE Transactions on Neural Networks 10 1048-1055
  • [5] Drucker H(2003)A Memory-Based Approach to Anti-Spam Filtering for Mailing Lists Information Retrieval 6 49-73
  • [6] Wu D.(2000)undefined United States Patent 6 130-undefined
  • [7] Vapnik V.(undefined)undefined undefined undefined undefined-undefined
  • [8] Sakkis G.(undefined)undefined undefined undefined undefined-undefined
  • [9] Androutsopoulos I(undefined)undefined undefined undefined undefined-undefined
  • [10] Paliouras G(undefined)undefined undefined undefined undefined-undefined