A SURVEY OF TECHNIQUES FOR EVENT DETECTION IN TWITTER

被引:441
作者
Atefeh, Farzindar [1 ]
Khreich, Wael [1 ]
机构
[1] NLP Technol Inc, Montreal, PQ, Canada
关键词
event detection; event identification; microblogs; monitoring social media; Twitter data stream; INFORMATION; TRACKING; MICROBLOG;
D O I
10.1111/coin.12017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter is among the fastest-growing microblogging and online social networking services. Messages posted on Twitter (tweets) have been reporting everything from daily life stories to the latest local and global news and events. Monitoring and analyzing this rich and continuous user-generated content can yield unprecedentedly valuable information, enabling users and organizations to acquire actionable knowledge. This article provides a survey of techniques for event detection from Twitter streams. These techniques aim at finding real-world occurrences that unfold over space and time. In contrast to conventional media, event detection from Twitter streams poses new challenges. Twitter streams contain large amounts of meaningless messages and polluted content, which negatively affect the detection performance. In addition, traditional text mining techniques are not suitable, because of the short length of tweets, the large number of spelling and grammatical errors, and the frequent use of informal and mixed language. Event detection techniques presented in literature address these issues by adapting techniques from various fields to the uniqueness of Twitter. This article classifies these techniques according to the event type, detection task, and detection method and discusses commonly used features. Finally, it highlights the need for public benchmarks to evaluate the performance of different detection approaches and various features.
引用
收藏
页码:132 / 164
页数:33
相关论文
共 133 条
[1]  
Aggarwal CC, 2012, MINING TEXT DATA, P163, DOI [10.1007/978-1-4614-3223-4, 10.1007/978-1-4614-3223-4_6]
[2]  
Aggarwal CC, 2011, SOCIAL NETWORK DATA ANALYTICS, P1
[3]  
Allan J., 2000, Proceedings of the Ninth International Conference on Information and Knowledge Management. CIKM 2000, P374, DOI 10.1145/354756.354843
[4]  
Allan J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P37, DOI 10.1145/290941.290954
[5]  
Allan J., 2002, INTRO TOPIC DETECTIO, DOI DOI 10.1007/978-1-4615-0933-21
[6]  
Allan James, 1998, P DARPA BROADC NEWS
[7]  
Amer-Yahia Sihem., 2012, Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2012, Scottsdale, AZ, USA, May 20-24, 2012, P653
[8]  
[Anonymous], P 13 ACM SIGKDD INT
[9]  
[Anonymous], INT AAAI C WEBL SOC
[10]  
[Anonymous], ICWSM BARC SPAIN