Detecting Automation of Twitter Accounts: Are You a Human, Bot, or Cyborg?

被引:372
作者
Chu, Zi [1 ]
Gianvecchio, Steven [2 ]
Wang, Haining [3 ]
Jajodia, Sushil [4 ]
机构
[1] Twitter Inc, San Francisco, CA 94103 USA
[2] Mitre Corp, Mclean, VA 22102 USA
[3] Coll William & Mary, Dept Comp Sci, Williamsburg, VA 23185 USA
[4] George Mason Univ, Ctr Secure Informat Syst, Fairfax, VA 22030 USA
基金
美国国家科学基金会;
关键词
Automatic identification; bot; cyborg; Twitter; social networks;
D O I
10.1109/TDSC.2012.75
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Twitter is a new web application playing dual roles of online social networking and microblogging. Users communicate with each other by publishing text-based posts. The popularity and open structure of Twitter have attracted a large number of automated programs, known as bots, which appear to be a double-edged sword to Twitter. Legitimate bots generate a large amount of benign tweets delivering news and updating feeds, while malicious bots spread spam or malicious contents. More interestingly, in the middle between human and bot, there has emerged cyborg referred to either bot-assisted human or human-assisted bot. To assist human users in identifying who they are interacting with, this paper focuses on the classification of human, bot, and cyborg accounts on Twitter. We first conduct a set of large-scale measurements with a collection of over 500,000 accounts. We observe the difference among human, bot, and cyborg in terms of tweeting behavior, tweet content, and account properties. Based on the measurement results, we propose a classification system that includes the following four parts: 1) an entropy-based component, 2) a spam detection component, 3) an account properties component, and 4) a decision maker. It uses the combination of features extracted from an unknown user to determine the likelihood of being a human, bot, or cyborg. Our experimental evaluation demonstrates the efficacy of the proposed classification system.
引用
收藏
页码:811 / 824
页数:14
相关论文
共 52 条
[1]  
Alexa, 2011, TOP 500 SIT WEB AL
[2]  
[Anonymous], 2011, PHISHT JOIN FIGHT PH
[3]  
[Anonymous], 2011, TWITTER BLOG YOUR WO
[4]  
[Anonymous], 2009, BARACK OBAMA USES TW
[5]  
[Anonymous], 2011, TOP TRENDING TWITTER
[6]  
[Anonymous], 2009, AMAZON COMES TWITTER
[7]  
[Anonymous], P 6 INT ISCRAM C MAY
[8]  
[Anonymous], 2009, BEST BUY GOES ALL TW
[9]  
[Anonymous], 1991, ELEMENTS INFORM THEO, DOI [DOI 10.1002/0471200611, 10.1002/0471200611]
[10]  
[Anonymous], 2009, P 18 INT C WORLD WID