Visualizing distortions and recovering topology in continuous projection techniques

被引:88
作者
Aupetit, Michael [1 ]
机构
[1] CEA, DAM, Dept Anal Surveillance Environm, F-91680 Bruyeres Le Chatel, France
关键词
exploratory data analysis; continuous projection; high-dimensional data; distortion visualization; uncertainty visualization; topology recovering; Delaunay graph; Voronoi cells;
D O I
10.1016/j.neucom.2006.11.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The visualization of continuous multi-dimensional data based on their projection to a 2-dimensional space is a way to detect visually interesting patterns, as far as the projection provides a faithful image of the original data. In order to evaluate this faithfulness, we propose to visualize any measure associated to a projected datum or to a pair of projected data, by coloring the corresponding Voronoi cell in the projection space. We also define specific measures and show how they allow estimating visually whether some part of the projection is or is not a reliable image of the original manifolds. It also helps to figure out what the original topology of the data is, telling where the high-dimensional. manifolds have been torn or glued during the projection. We experiment these techniques with the principal component analysis and the curvilinear component analysis applied to artificial and real databases. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1304 / 1330
页数:27
相关论文
共 37 条
[1]  
[Anonymous], 1988, SELF ORG ASS MEMORY
[2]  
[Anonymous], 1952, Psychometrika
[3]  
ARAAR R, 2005, P 2005 IEEE S INF VI, P111
[4]   High-dimensional labeled data analysis with topology representing graphs [J].
Aupetit, M ;
Catz, T .
NEUROCOMPUTING, 2005, 63 :139-169
[5]   The Quickhull algorithm for convex hulls [J].
Barber, CB ;
Dobkin, DP ;
Huhdanpaa, H .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1996, 22 (04) :469-483
[6]  
BEARD MK, 1991, 9126 NCGIA
[7]  
BELLMAN R, 1991, ADAPTIVE CONTROL PRO
[8]   Cluster stability and the use of noise in interpretation of clustering [J].
Davidson, GS ;
Wylie, BN ;
Boyack, KW .
IEEE SYMPOSIUM ON INFORMATION VISUALIZATION 2001, PROCEEDINGS, 2001, :23-30
[9]  
de Silva V., 2004, S POINT BAS GRAPH
[10]   Curvilinear component analysis: A self-organizing neural network for nonlinear mapping of data sets [J].
Demartines, P ;
Herault, J .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (01) :148-154