A survey of learning-based techniques of email spam filtering

被引:174
作者
Blanzieri, Enrico [2 ]
Bryl, Anton [1 ]
机构
[1] Univ Trent, ICT Int Doctorate Sch, Trento, Italy
[2] Univ Trent, Dept Informat & Commun Technol, Trento, Italy
关键词
Spam filtering; Machine learning;
D O I
10.1007/s10462-009-9109-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Email spam is one of the major problems of the today's Internet, bringing financial damage to companies and annoying individual users. Among the approaches developed to stop spam, filtering is an important and popular one. In this paper we give an overview of the state of the art of machine learning applications for spam filtering, and of the ways of evaluation and comparison of different filtering methods. We also provide a brief description of other branches of anti-spam protection and discuss the use of various approaches in commercial and non-commercial anti-spam software solutions.
引用
收藏
页码:63 / 92
页数:30
相关论文
共 107 条
[1]  
Agrawal B, 2005, IEEE ICC, P1588
[2]  
ALBRECHT K, 2005, P 2 C EM ANT CEAS 20
[3]  
Androutsopoulos I, 2000, P WORKSH MACH LEARN, P9
[4]  
ANDROUTSOPOULOS I, 2000, P 23 ANN INT ACM SIG, P160
[5]  
ANDROUTSOPOULOS I, 2005, P 2 C EM ANT CEAS 20
[6]  
[Anonymous], 20042 NCSR DEM
[7]  
[Anonymous], 1998, Learning for Text Categorization
[8]  
[Anonymous], P 4 EUR C PRINC PRAC
[9]  
[Anonymous], ACM T ASIAN LANGUAGE, DOI DOI 10.1145/1039621.1039625
[10]  
[Anonymous], 2002, A plan for spam