DocuBurst: Visualizing Document Content using Language Structure

被引:69
作者
Collins, Christopher [1 ]
Carpendale, Sheelagh [2 ]
Penn, Gerald [1 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Univ Calgary, Calgary, AB, Canada
关键词
TREE;
D O I
10.1111/j.1467-8659.2009.01439.x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Textual data is at the forefront of information management problems today. One response has been the development of visualizations of text data. These visualizations, commonly based on simple attributes such as relative word frequency,, have become increasingly popular tools. We extend this direction, presenting the first visualization of document content which combines word frequency with the human-created structure in lexical databases to create a visualization that also reflects semantic content. DocBurst is a radial, space-filling layout of hyponoymy (the IS-A relation), overlaid with occurrence counts of words in a document of interest to provide visual summaries at varying levels of granularity. Interactive document analysis is supported with geometric and semantic zoom, selectable focus on individual words, and linked access to source text.
引用
收藏
页码:1039 / 1046
页数:8
相关论文
共 29 条
[1]   Categorization and Analysis of Text in Computer Mediated Communication Archives using Visualization [J].
Abbasi, Ahmed ;
Chen, Hsinchun .
PROCEEDINGS OF THE 7TH ACM/IEE JOINT CONFERENCE ON DIGITAL LIBRARIES: BUILDING & SUSTAINING THE DIGITAL ENVIRONMENT, 2007, :11-18
[2]  
Alcock Keith., 2004, WordNet relationship browser
[3]  
[Anonymous], 2005, P SIGCHI C HUM FACT
[4]  
[Anonymous], WORDLE BEAUTIFUL WOR
[5]  
[Anonymous], 2003, FIELD GUIDE DIGITAL
[6]  
[Anonymous], 1986, P SIGCHI C HUMAN FAC, DOI DOI 10.1145/22339.22342
[7]  
[Anonymous], 2000, P 1 N AM CHAPTER ASS
[8]  
[Anonymous], 1998, P IEEE S INF VIS
[9]  
BEDERSON BB, 2000, P ACM C US INT SOFTW, P217
[10]  
Bertin J., 1983, SEMIOLOGY GRAPHICS D