Class visualization of high-dimensional data with applications

被引:27
作者
Dhillon, IS
Modha, DS
Spangler, WS
机构
[1] Univ Texas, Dept Comp Sci, Austin, TX 78712 USA
[2] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA
关键词
class-preserving projections; classification; class tours; linear projections; multidimensional visualization; similarity graphs;
D O I
10.1016/S0167-9473(02)00144-5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The problem of visualizing high-dimensional data that has been categorized into various classes is considered. The goal in visualizing is to quickly absorb inter-class and intra-class relationships. Towards this end, class-preserving projections of the multidimensional data onto two-dimensional planes, which can be displayed on a computer screen, are introduced. These class-preserving projections maintain the high-dimensional class structure, and are closely related to Fisher's linear discriminants. By displaying sequences of such two-dimensional projections and by moving continuously from one projection to the next, an illusion of smooth motion through a multidimensional display can be created. Such sequences are called class tours. Furthermore, class-similarity graphs are overlaid on the two-dimensional projections to capture the distance relationships in the original high-dimensional space. The above visualization tools are illustrated on the classical Iris plant data, the ISOLET spoken letter data, and the PENDIGITS on-line handwriting data set. It is shown how the visual examination of the data can uncover latent class relationships. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:59 / 90
页数:32
相关论文
共 37 条
  • [1] Alimoglu F, 1996, THESIS BOGAZICI U IS
  • [2] Alimoglu F., 1996, P 5 TURK ART INT ART
  • [3] [Anonymous], 1979, Multivariate analysis
  • [4] THE GRAND TOUR - A TOOL FOR VIEWING MULTIDIMENSIONAL DATA
    ASIMOV, D
    [J]. SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1985, 6 (01): : 128 - 143
  • [5] ASIMOV D, 1985, P 17 S INT COMP SCI
  • [6] NUMERICAL METHODS FOR COMPUTING ANGLES BETWEEN LINEAR SUBSPACES
    BJORCK, A
    GOLUB, GH
    [J]. MATHEMATICS OF COMPUTATION, 1973, 27 (123) : 579 - 594
  • [7] BLAKE C, 1998, UCI REPOSITRY MACHIN
  • [8] Bryan JG, 1951, HARVARD EDUC REV, V21, P90
  • [9] BUJA A, 1997, DYNAMIC PROJECTIONS
  • [10] A spreadsheet approach to information visualization
    Chi, EH
    Barry, P
    Riedl, J
    Konstan, J
    [J]. IEEE SYMPOSIUM ON INFORMATION VISUALIZATION, PROCEEDINGS, 1997, : 17 - 24