CAS based clustering algorithm for Web users

被引:17
作者
Wan, Miao [1 ,2 ,3 ]
Li, Lixiang [1 ,2 ,3 ]
Xiao, Jinghua [1 ,4 ]
Yang, Yixian [1 ,2 ,3 ]
Wang, Cong [1 ,2 ,3 ]
Guo, Xiaolei [1 ,2 ,3 ]
机构
[1] Beijing Univ Posts & Telecommun, Informat Secur Ctr, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Key Lab Network & Informat Attack & Def Technol M, Beijing 100876, Peoples R China
[3] Beijing Univ Posts & Telecommun, Natl Engn Lab Disaster Backup & Recovery, Beijing 100876, Peoples R China
[4] Beijing Univ Posts & Telecommun, Sch Sci, Beijing 100876, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Clustering; Chaotic ant swarm (CAS); Web access logs; Web user clustering; SWARM OPTIMIZATION; ANT; PERSONALIZATION; CHAOS;
D O I
10.1007/s11071-010-9653-2
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
This article devises a clustering technique for detecting groups of Web users from Web access logs. In this technique, Web users are clustered by a new clustering algorithm which uses the mechanism analysis of chaotic ant swarm (CAS). This CAS based clustering algorithm is called as CAS-C and it solves clustering problems from the perspective of chaotic optimization. The performance of CAS-C for detecting Web user clusters is compared with the popular clustering method named k-means algorithm. Clustering qualities are evaluated via calculating the average intra-cluster and inter-cluster distance. Experimental results demonstrate that CAS-C is an effective clustering technique with larger average intra-cluster distance and smaller average inter-cluster distance than k-means algorithm. The statistical analysis of resulted distances also proves that the CAS-C based Web user clustering algorithm has better stability. In order to show the utility, the proposed approach is applied to a pre-fetching task which predicts user requests with encouraging results.
引用
收藏
页码:347 / 361
页数:15
相关论文
共 41 条
  • [1] CHAOTIC NEURAL NETWORKS
    AIHARA, K
    TAKABE, T
    TOYODA, M
    [J]. PHYSICS LETTERS A, 1990, 144 (6-7) : 333 - 340
  • [2] Anderson CR, 2002, THESIS U WASHINGTON
  • [3] [Anonymous], 2000, THESIS U MINNESOTA
  • [4] [Anonymous], 1995, TR95010 BOST U DEP C
  • [5] ANSARI S, 2000, P 2001 IEEE INT C DA, V1, P27
  • [6] Using site semantics to analyze, visualize, and support navigation
    Berendt, B
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (01) : 37 - 59
  • [7] Bin Lan, 2000, Proceedings of the Ninth International Conference on Information and Knowledge Management. CIKM 2000, P504, DOI 10.1145/354756.354859
  • [8] Chaotic ant swarm optimization to economic dispatch
    Cai, Jiejin
    Ma, Xiaoqian
    Li, Lixiang
    Yang, Yixian
    Peng, Haipeng
    Wang, Xiangdong
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2007, 77 (10) : 1373 - 1380
  • [9] Chaotic particle swarm optimization for economic dispatch considering the generator constraints
    Cai Jiejin
    Ma Xiaoqian
    Li Lixiang
    Peng Haipeng
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2007, 48 (02) : 645 - 653
  • [10] CHARACTERIZING BROWSING STRATEGIES IN THE WORLD-WIDE-WEB
    CATLEDGE, LD
    PITKOW, JE
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1995, 27 (06): : 1065 - 1073