Uncovering hierarchical structure in data using the growing hierarchical self-organizing map

被引:84
作者
Dittenbach, M [1 ]
Rauber, A [1 ]
Merkl, D [1 ]
机构
[1] Vienna Univ Technol, Inst Software Technol, A-1040 Vienna, Austria
关键词
self-organizing map (SOM); unsupervised hierarchical clustering; document classification; data mining; exploratory data analysis;
D O I
10.1016/S0925-2312(01)00655-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
Discovering the inherent structure in data has become one of the major challenges in data mining applications. It requires stable and adaptive models that are capable of handling the typically very high-dimensional feature spaces. In particular, the representation of hierarchical relations and intuitively visible cluster boundaries are essential for a wide range of data mining applications. Current approaches based on neural networks hardly fulfill these requirements within a single model. In this paper we present the growing hierarchical self-organizing map (GHSOM), a neural network model based on the self-organizing map. The main feature of this novel architecture is its capability of growing both in terms of map size as well as in a three-dimensional tree-structure in order to represent the hierarchical structure present in a data collection during an unsupervised training process. This capability, combined with the stability of the self-organizing map for high-dimensional feature space representation, makes it an ideal tool for data analysis and exploration. We demonstrate the potential of the GHSOM with an application from the information retrieval domain, which is prototypical both of the high-dimensional feature spaces frequently encountered in today's applications as well as of the hierarchical nature of data. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:199 / 216
页数:18
相关论文
共 27 条
[1]
[Anonymous], 1997, Proc. of the workshop on self-organizing maps (WSOM97)
[2]
BLACKMORE J, 1993, P IEEE INT C NEUR NE, V1, P450
[3]
Dale R., 2000, HDB NATURAL LANGUAGE, P889
[4]
Deboeck G., 1998, VISUAL EXPLORATIONS
[5]
FRITZKE B, 1995, NEURAL PROCESS LETT, V2, P1
[6]
Data clustering: A review [J].
Jain, AK ;
Murty, MN ;
Flynn, PJ .
ACM COMPUTING SURVEYS, 1999, 31 (03) :264-323
[7]
Kaski S., 1998, Neural Computing Surveys, V1
[8]
Self organization of a massive document collection [J].
Kohonen, T ;
Kaski, S ;
Lagus, K ;
Salojärvi, J ;
Honkela, J ;
Paatero, V ;
Saarela, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03) :574-585
[9]
SELF-ORGANIZED FORMATION OF TOPOLOGICALLY CORRECT FEATURE MAPS [J].
KOHONEN, T .
BIOLOGICAL CYBERNETICS, 1982, 43 (01) :59-69
[10]
KOHONEN T, 1998, P ICANN98 8 INT C AR, V1, P65