Detecting phishing web pages with visual similarity assessment based on Earth Mover's Distance (EMD)

被引:159
作者
Fu, Anthony Y. [1 ]
Wenyin, Liu [1 ]
Deng, Xiaotie [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
antiphishing; visual assessment; Earth Mover's Distance;
D O I
10.1109/TDSC.2006.50
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An effective approach to phishing Web page detection is proposed, which uses Earth Mover's Distance (EMD) to measure Web page visual similarity. We first convert the involved Web pages into low resolution images and then use color and coordinate features to represent the image signatures. We use EMD to calculate the signature distances of the images of the Web pages. We train an EMD threshold vector for classifying a Web page as a phishing or a normal one. Large-scale experiments with 10,281 suspected Web pages are carried out to show high classification precision, phishing recall, and applicable time performance for online enterprise solution. We also compare our method with two others to manifest its advantage. We also built up a real system which is already used online and it has caught many real phishing cases.
引用
收藏
页码:301 / 311
页数:11
相关论文
共 24 条
[1]  
Broder A. Z., 1997, P 6 INT WORLD WID WE, V29, P1157, DOI [DOI 10.1016/S0169-7552(97)00031-7, 10.1016/S0169-7552(97)00031-7]
[2]  
Chen Y., 2003, Proceedings of the WWW'03, P225
[3]   Collection statistics for fast duplicate document detection [J].
Chowdhury, A ;
Frieder, O ;
Grossman, D ;
McCabe, MC .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (02) :171-191
[4]  
Cohen S., 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P1076, DOI 10.1109/ICCV.1999.790393
[5]  
Dhamija R., 2005, P S US PRIV SEC
[6]  
Fu AY, 2005, LECT NOTES COMPUT SC, V3806, P618
[7]  
Grauman K, 2004, PROC CVPR IEEE, P220
[8]  
Gu X., 2002, P 2 INT C AD HYP AD, P29
[9]  
Hillier F.S., 1990, INTRO MATH PROGRAMMI
[10]  
Hitchcock F. L., 1941, Journal of Mathematics and Physics, V20, P224, DOI DOI 10.1002/SAPM1941201224