Think globally, fit locally: Unsupervised learning of low dimensional manifolds

被引:699
作者
Saul, LK
Roweis, ST
机构
[1] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
[2] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G4, Canada
关键词
D O I
10.1162/153244304322972667
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of dimensionality reduction arises in many fields of information processing, including machine learning, data compression, scientific visualization, pattern recognition, and neural computation. Here we describe locally linear embedding (LLE), an unsupervised learning algorithm that computes low dimensional, neighborhood preserving embeddings of high dimensional data. The data, assumed to be sampled from an underlying manifold, are mapped into a single global coordinate system of lower dimensionality. The mapping is derived from the symmetries of locally linear reconstructions, and the actual computation of the embedding reduces to a sparse eigenvalue problem. Notably, the optimizations in LLE-though capable of generating highly nonlinear embeddings-are simple to implement, and they do not involve local minima. In this paper, we describe the implementation of the algorithm in detail and discuss several extensions that enhance its performance. We present results of the algorithm applied to data sampled from known manifolds, as well as to collections of images of faces, lips, and handwritten digits. These examples are used to provide extensive illustrations of the algorithm's performance-both successes and failures-and to relate the algorithm to previous and ongoing work in nonlinear dimensionality reduction.
引用
收藏
页码:119 / 155
页数:37
相关论文
共 80 条
[71]  
TEH YW, 2003, ADV NEURAL INFORMATI, V15
[72]   A global geometric framework for nonlinear dimensionality reduction [J].
Tenenbaum, JB ;
de Silva, V ;
Langford, JC .
SCIENCE, 2000, 290 (5500) :2319-+
[73]  
Tenenbaum JB, 1998, ADV NEUR IN, V10, P682
[74]   Mixtures of probabilistic principal component analyzers [J].
Tipping, ME ;
Bishop, CM .
NEURAL COMPUTATION, 1999, 11 (02) :443-482
[75]   A k-segments algorithm for finding principal curves [J].
Verbeek, JJ ;
Vlassis, N ;
Kröse, B .
PATTERN RECOGNITION LETTERS, 2002, 23 (08) :1009-1017
[76]  
VERBEEK JJ, 2002, IASUVA0201 U AMST CO
[77]   Supervised dimension reduction of intrinsically low-dimensional data [J].
Vlassis, N ;
Motomura, Y ;
Kröse, B .
NEURAL COMPUTATION, 2002, 14 (01) :191-215
[78]  
Weiss Y., 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P975, DOI 10.1109/ICCV.1999.790354
[79]  
Williams CKI, 2001, ADV NEUR IN, V13, P675
[80]  
Yu SX, 2001, LECT NOTES COMPUT SC, V2134, P283