Self-organizing maps: Generalizations and new optimization techniques

被引:83
作者
Graepel, T [1 ]
Burger, M [1 ]
Obermayer, K [1 ]
机构
[1] Tech Univ Berlin, Dept Comp Sci, D-10587 Berlin, Germany
关键词
topographic mapping; deterministic annealing; EM algorithm; kernel-based; proximity data;
D O I
10.1016/S0925-2312(98)00035-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We offer three algorithms for the generation of topographic mappings to the practitioner of unsupervised data analysis. The algorithms are each based on the minimization of a cost function which is performed using an EM algorithm and deterministic annealing. The soft topographic vector quantization algorithm (STVQ) - like the original self-organizing map (SOM) - provides a tool for the creation of self-organizing maps of Euclidean data. Its optimization scheme, however, offers an alternative to the heuristic stepwise shrinking of the neighborhood width in the SOM and makes it possible to use a fixed neighborhood function solely to encode desired neighborhood relations between nodes. The kernel-based soft topographic mapping (STMK) is a generalization of STVQ and introduces new distance measures in data space, based on kernel functions. Using the new distance measures corresponds to performing the STVQ in a high-dimensional feature space, which is related to data space by a nonlinear mapping. This preprocessing can reveal structure in the data which may go unnoticed if the STVQ is performed in the standard Euclidean space. The soft topographic mapping for proximity data (STMP) is another generalization of STVQ that enables the user to generate topographic maps for data which are given in terms of pairwise proximities. It thus offers a flexible alternative to multidimensional scaling methods and opens up a new range of applications for SOMs. Both STMK and STMP share the robust optimization properties of STVQ due to the application of deterministic annealing. In our contribution we discuss the algorithms together with their implementation and provide detailed pseudo-code and explanations. (C) 1998 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:173 / 190
页数:18
相关论文
共 36 条
[1]  
Aizerman M., 1964, AUTOMAT REM CONTR, V25, P821, DOI DOI 10.1234/12345678
[2]  
[Anonymous], 1988, SELF ORG ASS MEMORY, DOI DOI 10.1007/978-3-662-00784-6_10
[3]   VECTOR QUANTIZATION WITH COMPLEXITY COSTS [J].
BUHMANN, J ;
KUHNEL, H .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (04) :1133-1145
[4]  
Burger M, 1998, ADV NEUR IN, V10, P430
[5]  
BURGER M, 1997, ICANN 97, V1327, P619
[6]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[7]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]   MODELS OF ORIENTATION AND OCULAR DOMINANCE COLUMNS IN THE VISUAL-CORTEX - A CRITICAL COMPARISON [J].
ERWIN, E ;
OBERMAYER, K ;
SCHULTEN, K .
NEURAL COMPUTATION, 1995, 7 (03) :425-468
[9]  
GOLD S, 1995, ADV NEURAL INFORMATI, V7, P713
[10]   Phase transitions in stochastic self-organizing maps [J].
Graepel, T ;
Burger, M ;
Obermayer, K .
PHYSICAL REVIEW E, 1997, 56 (04) :3876-3890