Learning eigenfunctions links spectral embedding and kernel PCA

被引:145
作者
Bengio, Y [1 ]
Delalleau, O [1 ]
Le Roux, N [1 ]
Paiement, JF [1 ]
Vincent, P [1 ]
Ouimet, M [1 ]
机构
[1] Univ Montreal, Ctr Rech Math, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
关键词
D O I
10.1162/0899766041732396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this letter, we show a direct relation between spectral embedding methods and kernel principal components analysis and how both are special cases of a more general learning problem: learning the principal eigenfunctions of an operator defined from a kernel and the unknown data-generating density. Whereas spectral embedding methods provided only coordinates for the training points, the analysis justifies a simple extension to out-of-sample examples (the Nystrom formula) for multidimensional scaling (MDS), spectral clustering, Laplacian eigenmaps, locally linear embedding (LLE), and Isomap. The analysis provides, for all such spectral embedding methods, the definition of a loss function, whose empirical average is minimized by the traditional algorithms. The asymptotic expected value of that loss defines a generalization performance and clarifies what these algorithms are trying to learn. Experiments with LLE, Isomap, spectral clustering, and MDS show that this out-of-sample embedding formula generalizes well, with a level of error comparable to the effect of small perturbations of the training set on the embedding.
引用
收藏
页码:2197 / 2219
页数:23
相关论文
共 28 条
  • [1] [Anonymous], P 17 INT C MACH LEAR
  • [2] [Anonymous], 2002, J MACH LEARN RES
  • [3] [Anonymous], 200308 STANF U DEP S
  • [4] BAKER CTH, 1977, NUMERICAL TREATMENT
  • [5] Laplacian eigenmaps for dimensionality reduction and data representation
    Belkin, M
    Niyogi, P
    [J]. NEURAL COMPUTATION, 2003, 15 (06) : 1373 - 1396
  • [6] BENGIO Y, 2004, ADV NEURAL INFORMATI, V16
  • [7] Chung F, 1997, C BOARD MATH SCI AM
  • [8] Cox T. F., 1994, MULTIDIMENSIONAL SCA
  • [9] DESILVA V, 2003, ADV NEURAL INFORM PR, V15, P705
  • [10] GOWER JC, 1968, BIOMETRIKA, V55, P582, DOI 10.1093/biomet/55.3.582