Class visualization of high-dimensional data with applications

被引:27
作者
Dhillon, IS
Modha, DS
Spangler, WS
机构
[1] Univ Texas, Dept Comp Sci, Austin, TX 78712 USA
[2] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA
关键词
class-preserving projections; classification; class tours; linear projections; multidimensional visualization; similarity graphs;
D O I
10.1016/S0167-9473(02)00144-5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The problem of visualizing high-dimensional data that has been categorized into various classes is considered. The goal in visualizing is to quickly absorb inter-class and intra-class relationships. Towards this end, class-preserving projections of the multidimensional data onto two-dimensional planes, which can be displayed on a computer screen, are introduced. These class-preserving projections maintain the high-dimensional class structure, and are closely related to Fisher's linear discriminants. By displaying sequences of such two-dimensional projections and by moving continuously from one projection to the next, an illusion of smooth motion through a multidimensional display can be created. Such sequences are called class tours. Furthermore, class-similarity graphs are overlaid on the two-dimensional projections to capture the distance relationships in the original high-dimensional space. The above visualization tools are illustrated on the classical Iris plant data, the ISOLET spoken letter data, and the PENDIGITS on-line handwriting data set. It is shown how the visual examination of the data can uncover latent class relationships. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:59 / 90
页数:32
相关论文
共 37 条
  • [21] Gnanadesikan R., 1982, STAT PROBABILITY ESS, P269
  • [22] Golub G.H., 2013, MATRIX COMPUTATIONS
  • [23] Quantization
    Gray, RM
    Neuhoff, DL
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) : 2325 - 2383
  • [24] Grinstein G, 1995, VISUALIZATION '95 - PROCEEDINGS, P405
  • [25] Hartigan J. A., 1975, CLUSTERING ALGORITHM
  • [26] PROJECTION PURSUIT
    HUBER, PJ
    [J]. ANNALS OF STATISTICS, 1985, 13 (02) : 435 - 475
  • [27] ANALYZING HIGH-DIMENSIONAL DATA WITH MOTION GRAPHICS
    HURLEY, C
    BUJA, A
    [J]. SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1990, 11 (06): : 1193 - 1211
  • [28] Kohonen T., 1995, SELF ORG MAPS
  • [29] A NONLINEAR PROJECTION METHOD BASED ON KOHONENS TOPOLOGY PRESERVING-MAPS
    KRAAIJVELD, MA
    MAO, JC
    JAIN, AK
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (03): : 548 - 559
  • [30] Kruskal J.B., 1977, Statistical Methods for Digital Computers, P296