Reduction of the dimension of a document space using the fuzzified output of a Kohonen network

被引:13
作者
Guerrero, VP
Anegón, FD
机构
[1] Univ Extremadura, Fac Lib & Informat Sci, Badajoz 06011, Spain
[2] Univ Granada, Fac Lib & Informat Sci, E-18071 Granada, Spain
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2001年 / 52卷 / 14期
关键词
D O I
10.1002/asi.1189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The vectors used in IR, whether to represent the documents or the terms, are high dimensional, and their dimensions increase as one approaches real problems. The algorithms used to manipulate them, however consume enormously increasing amounts of computational capacity as the said dimension grows. We used the Kohonen algorithm and a fuzzification module to perform a fuzzy clustering of the terms. The degrees of membership obtained were used to represent the terms and, by extension, the documents, yielding a smaller number of components but still endowed with meaning. To test the results, we use a topological classification of sets of transformed and untransformed vectors to check that the same structure underlies both.
引用
收藏
页码:1234 / 1241
页数:8
相关论文
共 36 条
[1]  
ANEGON FM, 1999, REPRESENTACION ORG C, P151
[2]  
ANEGOU FM, 1994, SISTEMAS INTEGRADOS
[3]  
[Anonymous], 1984, SELF ORG ASS MEMORY
[4]  
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[5]  
BEZDEK JC, 1992, IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, P1035, DOI 10.1109/FUZZY.1992.258797
[6]  
BOTE G, 1997, THESIS U GRANADA SPA
[7]  
BOTE VPG, 2001, INFORMATION PROCESSI, V38, P79
[8]  
Chen HC, 1998, J AM SOC INFORM SCI, V49, P582, DOI 10.1002/(SICI)1097-4571(1998)49:7<582::AID-ASI2>3.0.CO
[9]  
2-V
[10]  
ELHAMDOUCHI A, 1989, INFORMATION PROCESSI, V24, P17