Image spam filtering using visual information

被引:22
作者
Biggio, Battista [1 ]
Fumera, Giorgio [1 ]
Pillai, Ignazio [1 ]
Roli, Fabio [1 ]
机构
[1] Univ Cagliari, Dept Elect & Elect Engn, Piazza Armi, I-09123 Cagliari, Italy
来源
14TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS | 2007年
关键词
D O I
10.1109/ICIAP.2007.4362765
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We address the problem of recognizing the so-called image spam, which consists in embedding the spam message into attached images to defeat techniques based on the analysis of e-mails' body text, and in using content obscuring techniques to defeat OCR tools. We propose an approach to recognize image spam based on detecting the presence of content obscuring techniques, and describe a possible implementation based on two low-level image features aimed at detecting obscuring techniques whose consequence is to compromise the OCR effectiveness resulting in character breaking or merging, or in the presence of noise interfering with characters in the binarized image. A preliminary experimental investigation of this approach is reported on a personal data set of spam images.
引用
收藏
页码:105 / +
页数:2
相关论文
共 12 条
[1]  
ANDROUTSOPOULOS A, 2000, P ACM INT C RES DEV, P160
[2]  
[Anonymous], WS9805 AAAI
[3]  
[Anonymous], 2002, A plan for spam
[4]   Image analysis for efficient categorization of image-based spam e-mail [J].
Aradhye, HB ;
Myers, GK ;
Herson, JA .
EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, :914-918
[5]  
BAIRD HS, 2003, P IS T SPIE DOC REC
[6]  
Blando L. R., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P319, DOI 10.1109/ICDAR.1995.599003
[7]   Support vector machines for spam categorization [J].
Drucker, H ;
Wu, DH ;
Vapnik, VN .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05) :1048-1054
[8]  
Fumera G, 2006, J MACH LEARN RES, V7, P2699
[9]   Text information extraction in images and video: a survey [J].
Jung, K ;
Kim, KI ;
Jain, AK .
PATTERN RECOGNITION, 2004, 37 (05) :977-997
[10]  
Mori G, 2003, PROC CVPR IEEE, P134