Design and Evaluation of a Real-Time URL Spam Filtering Service

被引:221
作者
Thomas, Kurt [1 ]
Grier, Chris [1 ]
Ma, Justin [1 ]
Paxson, Vern [1 ]
Song, Dawn [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
2011 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP 2011) | 2011年
关键词
D O I
10.1109/SP.2011.25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
On the heels of the widespread adoption of web services such as social networks and URL shorteners, scams, phishing, and malware have become regular threats. Despite extensive research, email-based spam filtering techniques generally fall short for protecting other web services. To better address this need, we present Monarch, a real-time system that crawls URLs as they are submitted to web services and determines whether the URLs direct to spam. We evaluate the viability of Monarch and the fundamental challenges that arise due to the diversity of web service spam. We show that Monarch can provide accurate, real-time protection, but that the underlying characteristics of spam do not generalize across web services. In particular, we find that spam targeting email qualitatively differs in significant ways from spam campaigns targeting Twitter. We explore the distinctions between email and Twitter spam, including the abuse of public web hosting and redirector services. Finally, we demonstrate Monarch's scalability, showing our system could protect a service such as Twitter-which needs to process 15 million URLs/day-for a bit under $800/day.
引用
收藏
页码:447 / 462
页数:16
相关论文
共 61 条
[1]  
*ADV NETW TECHN CT, 2010, U OR ROUT VIEWS PROJ
[2]  
*AM WEB SERV, 2009, AM EC2 INST TYP
[3]  
Anderson D., 2007, USENIX SECURITY
[4]  
[Anonymous], 2010, WEB C WWW
[5]  
[Anonymous], 2010, P USENIX C LARG SCAL
[6]  
[Anonymous], 2009, P 15 ACM SIGKDD INT
[7]  
[Anonymous], P 15 INT C WORLD WID
[8]  
[Anonymous], 2008, P 1 US WORKSH LARG S
[9]  
[Anonymous], P 14 ACM C COMP COMM
[10]  
[Anonymous], 2010, NY TIMES