#FluxFlow: Visual Analysis of Anomalous Information Spreading on Social Media

被引:152
作者
Zhao, Jian [1 ]
Cao, Nan [2 ]
Wen, Zhen [2 ]
Song, Yale [3 ]
Lin, Yu-Ru [4 ]
Collins, Christopher [5 ]
机构
[1] Univ Toronto, Toronto, ON M5S 1A1, Canada
[2] IBM J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[3] MIT, Cambridge, MA 02139 USA
[4] Univ Pittsburgh, Pittsburgh, PA 15260 USA
[5] UOIT, Oshawa, ON, Canada
关键词
Retweeting threads; anomaly detection; social media; visual analytics; machine learning; information visualization; ANALYTICS;
D O I
10.1109/TVCG.2014.2346922
中图分类号
TP31 [计算机软件];
学科分类号
081205 [计算机软件];
摘要
We present FluxFlow, an interactive visual analysis system for revealing and analyzing anomalous information spreading in social media. Everyday, millions of messages are created, commented, and shared by people on social media websites, such as Twitter and Facebook. This provides valuable data for researchers and practitioners in many application domains, such as marketing, to inform decision-making. Distilling valuable social signals from the huge crowd's messages, however, is challenging, due to the heterogeneous and dynamic crowd behaviors. The challenge is rooted in data analysts' capability of discerning the anomalous information behaviors, such as the spreading of rumors or misinformation, from the rest that are more conventional patterns, such as popular topics and newsworthy events, in a timely fashion. FluxFlow incorporates advanced machine learning algorithms to detect anomalies, and offers a set of novel visualization designs for presenting the detected threads for deeper analysis. We evaluated FluxFlow with real datasets containing the Twitter feeds captured during significant events such as Hurricane Sandy. Through quantitative measurements of the algorithmic performance and qualitative interviews with domain experts, the results show that the back-end anomaly detection model is effective in identifying anomalous retweeting threads, and its front-end interactive visualizations are intuitive and useful for analysts to discover insights in data and comprehend the underlying analytical model.
引用
收藏
页码:1773 / 1782
页数:10
相关论文
共 41 条
[31]
Pak A, 2010, 7 INT C LANGUAGE RES
[32]
Estimating the support of a high-dimensional distribution [J].
Schölkopf, B ;
Platt, JC ;
Shawe-Taylor, J ;
Smola, AJ ;
Williamson, RC .
NEURAL COMPUTATION, 2001, 13 (07) :1443-1471
[33]
Visual Analysis of Social Media Data [J].
Schreck, Tobias ;
Keim, Daniel .
COMPUTER, 2013, 46 (05) :68-75
[34]
The eyes have it: A task by data type taxonomy for information visualizations [J].
Shneiderman, B .
IEEE SYMPOSIUM ON VISUAL LANGUAGES, PROCEEDINGS, 1996, :336-343
[35]
Song Y., 2013, Proceedings of the Twenty-Third international joint conference on Artificial Intelligence, P1685
[36]
The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods [J].
Tausczik, Yla R. ;
Pennebaker, James W. .
JOURNAL OF LANGUAGE AND SOCIAL PSYCHOLOGY, 2010, 29 (01) :24-54
[37]
Viegas F., 2013, P 22 INT C WORLD WID, P1389
[38]
Wang W., 2006, P SIGCHI C HUM FACT, P517, DOI DOI 10.1145/1124772.1124851
[39]
Visual Analysis of Topic Competition on Social Media [J].
Xu, Panpan ;
Wu, Yingcai ;
Wei, Enxun ;
Peng, Tai-Quan ;
Liu, Shixia ;
Zhu, Jonathan J. H. ;
Qu, Huamin .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) :2012-2021
[40]
Yang L, 2010, PROCEEDINGS OF THE 2010 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND SCIENTIFIC MANAGEMENT, VOLS 1-2, P216