Reinventing the Contingency Wheel: Scalable Visual Analytics of Large Categorical Data

被引:15
作者
Alsallakh, Bilal [1 ]
Aigner, Wolfgang
Miksch, Silvia [1 ]
Groeller, M. Eduard [1 ]
机构
[1] Vienna Univ Technol, Vienna, Austria
基金
奥地利科学基金会;
关键词
Large categorical data; contingency table analysis; information interfaces and representation; visual analytics;
D O I
10.1109/TVCG.2012.254
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Contingency tables summarize the relations between categorical variables and arise in both scientific and business domains. Asymmetrically large two-way contingency tables pose a problem for common visualization methods. The Contingency Wheel has been recently proposed as an interactive visual method to explore and analyze such tables. However, the scalability and readability of this method are limited when dealing with large and dense tables. In this paper we present Contingency Wheel++, new visual analytics methods that overcome these major shortcomings: (1) regarding automated methods, a measure of association based on Pearson's residuals alleviates the bias of the raw residuals originally used, (2) regarding visualization methods, a frequency-based abstraction of the visual elements eliminates overlapping and makes analyzing both positive and negative associations possible, and (3) regarding the interactive exploration environment, a multi-level overview+detail interface enables exploring individual data items that are aggregated in the visualization or in the table using coordinated views. We illustrate the applicability of these new methods with a use case and show how they enable discovering and analyzing nontrivial patterns and associations in large categorical data.
引用
收藏
页码:2849 / 2858
页数:10
相关论文
共 42 条
  • [1] Alsallakh B., 2011, PROC EUROVA, P53
  • [2] [Anonymous], 2002, Principal components analysis
  • [3] [Anonymous], 2005, Illuminating the path: The research and development agenda for visual analytics (Tech. Rep.)
  • [4] Parallel sets: Visual analysis of categorical data
    Bendix, F
    Kosara, R
    Hauser, H
    [J]. INFOVIS 05: IEEE SYMPOSIUM ON INFORMATION VISUALIZATION, PROCEEDINGS, 2005, : 133 - 140
  • [5] Benzecri J.P., 1990, CORRES ANAL HDB
  • [6] KNIME:: The Konstanz Information Miner
    Berthold, Michael R.
    Cebron, Nicolas
    Dill, Fabian
    Gabriel, Thomas R.
    Koetter, Tobias
    Meinl, Thorsten
    Ohl, Peter
    Sieb, Christoph
    Thiel, Kilian
    Wiswedel, Bernd
    [J]. DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 319 - 326
  • [7] Bertin J., 1983, SEMIOLOGY GRAPHICS D
  • [8] Dix A., 2002, P WORK C ADV VIS INT, P167, DOI DOI 10.1145/1556262.1556289
  • [9] Interactive information visualization of a million items
    Fekete, JD
    Plaisant, C
    [J]. INFOVIS 2002: IEEE SYMPOSIUM ON INFORMATION VISUALIZATION 2002, 2002, : 117 - 124
  • [10] Friendly M., 1992, SAS User Group International Conference, P190