Regularized principal manifolds

被引:47
作者
Smola, AJ
Mika, S
Schölkopf, B
Williamson, RC
机构
[1] Australian Natl Univ, Dept Engn & RSISE, Canberra, ACT 0200, Australia
[2] GMD FIRST, D-12489 Berlin, Germany
[3] Australian Natl Univ, Dept Engn, Canberra, ACT 0200, Australia
关键词
regularization; uniform convergence; kernels; entropy numbers; principal curves; clustering; generative topographic map; support vector machines; kernel PCA;
D O I
10.1162/15324430152748227
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many settings of unsupervised learning can be viewed as quantization problems - the minimization of the expected quantization error subject to some restrictions. This allows the use of tools such as regularization from the theory of (supervised) risk minimization for unsupervised learning. This setting turns out to be closely related to principal curves, the generative topographic map, and robust coding. We explore connection in two ways: (1) we propose an algorithm for finding principal manifolds that can be regularized in a variety of ways; and (2) we derive uniform convergence bounds and hence bounds on the learning rates of the algorithm. In particular, we give bounds on the covering numbers which allows us to obtain nearly optimal learning rates for certain types of regularization operators. Experimental results demonstrate the feasibility of the approach.
引用
收藏
页码:179 / 209
页数:31
相关论文
共 45 条
[1]  
AIZERMAN MA, 1965, AUTOMAT REM CONTR+, V25, P821
[2]  
[Anonymous], P 2 INT C COMP VIS
[3]  
[Anonymous], 1982, ESTIMATION DEPENDENC
[4]  
ANTHONY M, 1999, THEORY LEARNING ARTI
[5]   The minimax distortion redundancy in empirical quantizer design [J].
Bartlett, PL ;
Linder, T ;
Lugosi, G .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (05) :1802-1813
[6]  
BENNETT K, 1999, ADV KERNEL METHOS SV, P37
[7]   TRAINING WITH NOISE IS EQUIVALENT TO TIKHONOV REGULARIZATION [J].
BISHOP, CM .
NEURAL COMPUTATION, 1995, 7 (01) :108-116
[8]   GTM: The generative topographic mapping [J].
Bishop, CM ;
Svensen, M ;
Williams, CKI .
NEURAL COMPUTATION, 1998, 10 (01) :215-234
[9]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[10]  
Bradley P.S., 1998, 9805 U WISC MAD