Incremental Visual Text Analytics of News Story Development

被引:4
作者
Krstajic, Milos [1 ]
Najm-Araghi, Mohammad [1 ]
Mansmann, Florian [1 ]
Keim, Daniel A. [1 ]
机构
[1] Univ Konstanz, Constance, Germany
来源
VISUALIZATION AND DATA ANALYSIS 2012 | 2012年 / 8294卷
关键词
News Stream Analysis; Topic Evolution; Dynamic Visualization; Text Analytics;
D O I
10.1117/12.912456
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
080906 [电磁信息功能材料与结构]; 082806 [农业信息与电气工程];
摘要
Online news sources produce thousands of news articles every day, reporting on local and global real-world events. New information quickly replaces the old, making it difficult for readers to put current events in the context of the past. Additionally, the stories have very complex relationships and characteristics that are difficult to model: they can be weakly or strongly connected, or they can merge or split over time. In this paper, we present a visual analytics system for exploration of news topics in dynamic information streams, which combines interactive visualization and text mining techniques to facilitate the analysis of similar topics that split and merge over time. We employ text clustering techniques to automatically extract stories from online news streams and present a visualization that: 1) shows temporal characteristics of stories in different time frames with different level of detail; 2) allows incremental updates of the display without recalculating the visual features of the past data; 3) sorts the stories by minimizing clutter and overlap from edge crossings. By using interaction, stories can be filtered based on their duration and characteristics in order to be explored in full detail with details on demand. To demonstrate the usefulness of our system, case studies with real news data are presented and show the capabilities for detailed dynamic text stream exploration.
引用
收藏
页数:12
相关论文
共 22 条
[1]
Aigner W, 2011, HUM-COMPUT INT-SPRIN, P1, DOI 10.1007/978-0-85729-079-3
[2]
[Anonymous], 2006, DATA STREAM MANAGEME
[3]
[Anonymous], WWW 09
[4]
[Anonymous], 1 INT WORKSH DAT ENG
[5]
[Anonymous], 1998, P BROADC NEWS TRANSC
[6]
[Anonymous], IEEE T VISUALIZATION
[7]
[Anonymous], 2010, Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '10
[8]
Blei D.M., 2006, INT C MACHINE LEARNI, DOI DOI 10.1145/1143844.1143859
[9]
Stacked Graphs - Geometry & Aesthetics [J].
Byron, Lee ;
Wattenberg, Martin .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2008, 14 (06) :1245-1252
[10]
A Visual Backchannel for Large-Scale Events [J].
Doerk, Marian ;
Gruen, Daniel ;
Williamson, Carey ;
Carpendale, Sheelagh .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) :1129-1138