HDGSOMr:: A high dimensional growing self-organizing map using randomness for efficient web and text mining

被引:5
作者
Amarasiri, R [1 ]
Alahakoon, D [1 ]
Smith, K [1 ]
Premaratne, M [1 ]
机构
[1] Monash Univ, Sch Business Syst, Clayton, Vic 3168, Australia
来源
2005 IEEE/WIC/ACM International Conference on Web Intelligence, Proceedings | 2005年
关键词
D O I
10.1109/WI.2005.70
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining of text data from the web has become a necessity, in modern days due to the volumes of data available on the web. While searching for information on the web using search engines is popular, to analyze the content on large collections of web pages, feature map techniques are still popular. One of the problems associated with processing large collections of text data from the web using feature map techniques is the time taken to cluster them. This paper presents an algorithm based on a growing variant of the Self Organizing Map called the HDGSOMr. This novel algorithm incorporates randonmess into the self-organizing process to produce higher quality clusters within few epochs and utilizing smaller neighborhood sizes resulting in a significant reduction in overall processing time. Details of the HDGSOMr algorithm and results of processing large collections of text data proving the efficiency of the algorithm are also presented.
引用
收藏
页码:215 / 221
页数:7
相关论文
共 22 条
  • [1] Dynamic self-organizing maps with controlled growth for knowledge discovery
    Alahakoon, D
    Halgamuge, SK
    Srinivasan, B
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03): : 601 - 614
  • [2] AMARASIRI R, 2004, INT INF TECHN C IITC
  • [3] AMARASIRI R, 2004, HYBR INT SYST 2004
  • [4] AMARASIRI R, 2005, INT J HYBRID INTELLI
  • [5] [Anonymous], TAMKANG J SCI ENG
  • [6] BLACKMORE J, 1993, IEEE INT C NEUR NETW
  • [7] CHIRAPHADHANAKU.S, 1997, 1997 IASTED INT C IN
  • [8] ETZONI O, 1996, COMMUN ACM, V39, P65
  • [9] GROWING GRID - A SELF-ORGANIZING NETWORK WITH CONSTANT NEIGHBORHOOD RANGE AND ADAPTATION STRENGTH
    FRITZKE, B
    [J]. NEURAL PROCESSING LETTERS, 1995, 2 (05) : 9 - 13
  • [10] Holland J. H., 1973, SIAM Journal on Computing, V2, P88, DOI 10.1137/0202009