Visualizing non-metric similarities in multiple maps

被引:240
作者
van der Maaten, Laurens [1 ]
Hinton, Geoffrey [2 ]
机构
[1] Delft Univ Technol, Pattern Recognit & Bioinformat Lab, NL-2628 CD Delft, Netherlands
[2] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Multidimensional scaling; Embedding; Data visualization; Non-metric similarities; NONLINEAR DIMENSIONALITY REDUCTION; LAPLACIAN EIGENMAPS; REPRESENTATION; FRAMEWORK;
D O I
10.1007/s10994-011-5273-4
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Techniques for multidimensional scaling visualize objects as points in a low-dimensional metric map. As a result, the visualizations are subject to the fundamental limitations of metric spaces. These limitations prevent multidimensional scaling from faithfully representing non-metric similarity data such as word associations or event co-occurrences. In particular, multidimensional scaling cannot faithfully represent intransitive pairwise similarities in a visualization, and it cannot faithfully visualize "central" objects. In this paper, we present an extension of a recently proposed multidimensional scaling technique called t-SNE. The extension aims to address the problems of traditional multidimensional scaling techniques when these techniques are used to visualize non-metric similarities. The new technique, called multiple maps t-SNE, alleviates these problems by constructing a collection of maps that reveal complementary structure in the similarity data. We apply multiple maps t-SNE to a large data set of word association data and to a data set of NIPS co-authorships, demonstrating its ability to successfully visualize non-metric similarities.
引用
收藏
页码:33 / 55
页数:23
相关论文
共 63 条
[1]
[Anonymous], 2007, ADV NEURAL INFORM PR
[2]
[Anonymous], 2007, ARTIF INTELL
[3]
[Anonymous], 2005, Illuminating the path: The research and development agenda for visual analytics (Tech. Rep.)
[4]
[Anonymous], 2008, P 25 INT C MACH LEAR, DOI DOI 10.1145/1390156.1390177
[5]
[Anonymous], 2004, ADV NEURAL INFORM PR
[6]
[Anonymous], 1952, Psychometrika
[7]
[Anonymous], 2009, Advances in neural information processing systems
[8]
[Anonymous], 2005, The Dissimilarity Representation for Pattern Recognition
[9]
[Anonymous], 2004, P NIPS 04 WORKSH DAT, DOI 10.1.1.84.5833
[10]
[Anonymous], 1998, U S FLORIDA WORD ASS