Marginal median SOM for document organization and retrieval

被引:13
作者
Georgakis, A [1 ]
Kotropoulos, C [1 ]
Xafopoulos, A [1 ]
Pitas, I [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Artificial Intelligence & Informat Anal Lab, GR-54124 Thessaloniki, Greece
关键词
self-organizing maps; order statistics; marginal median;
D O I
10.1016/j.neunet.2003.08.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The self-organizing map algorithm has been used successfully in document organization. We now propose using the same algorithm for document retrieval. Moreover, we test the performance of the self-organizing map by replacing the linear Least Mean Squares adaptation rule with the marginal median. We present two implementations of the latter variant of the self-organizing map by either quantizing the real valued feature vectors to integer valued ones or not. Experiments performed using both implementations demonstrate a superior performance against the self-organizing map based method in terms of the number of training iterations needed so that the mean square error (i.e. the average distortion) drops to the e(-1) = 36.788% of its initial value. Furthermore, the performance of a document organization and retrieval system employing the self-organizing map architecture and its variant is assessed using the average recall-precision curves evaluated on two corpora; the first comprises of manually selected web pages over the Internet having touristic content and the second one is the Reuters-21578, Distribution 1.0. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:365 / 377
页数:13
相关论文
共 26 条
[11]  
Kaski S, 1998, IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, P413, DOI 10.1109/IJCNN.1998.682302
[12]   Self organization of a massive document collection [J].
Kohonen, T ;
Kaski, S ;
Lagus, K ;
Salojärvi, J ;
Honkela, J ;
Paatero, V ;
Saarela, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03) :574-585
[13]   The self-organizing map [J].
Kohonen, T .
NEUROCOMPUTING, 1998, 21 (1-3) :1-6
[14]  
KOHONEN T, 1999, SELF ORG MASSIVE TEX, P171
[15]  
Kohonen T., 1997, Self-organizing Maps, V2nd ed.
[16]  
KOHONEN T, 1998, P ICANN98 8 INT C AR, V1, P65
[17]  
KORFHAGE RR, 1997, INFORMATION STORAGE
[18]   Text retrieval using self-organized document maps [J].
Lagus, K .
NEURAL PROCESSING LETTERS, 2002, 15 (01) :21-29
[19]  
Lehmann E., 1983, Theory of Point Estimation
[20]  
Lewis D., 1997, Reuters-21578 text categorization test collection, distribution 1.0