Clustering e-commerce search engines based on their search interface pages using WISE-Cluster

被引:8
作者
Lu, Yiyao
He, Hai
Peng, Qian
Meng, Weiyi [1 ]
Yu, Clement
机构
[1] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
[2] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
基金
美国国家科学基金会;
关键词
e-commerce; search engine; clustering; categorization; Web-based information systems;
D O I
10.1016/j.datak.2006.01.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new approach to clustering e-commerce search engines (ESEs) on the Web. Our approach utilizes the features available on the interface page of each ESE, including the label terms and value terms appearing in the search form, the number of images, normalized price terms as well as other terms. The experimental results based on more than 400 ESEs indicate that the proposed approach has good clustering accuracy. The importance of different types of features is analyzed and the terms in the search form are the most important feature in obtaining quality clusters. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:231 / 246
页数:16
相关论文
共 19 条
[1]  
[Anonymous], P ICML 97
[2]  
[Anonymous], P ACM INT C MAN DAT, DOI DOI 10.1145/872757.872784
[3]  
[Anonymous], 2004, P 13 ACM C INF KNOWL
[4]   Syntactic clustering of the Web [J].
Broder, AZ ;
Glassman, SC ;
Manasse, MS ;
Zweig, G .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1997, 29 (8-13) :1157-1166
[5]  
Cope J., 2003, P 14 AUSTR DAT C, V17, P181
[6]   INTRODUCTION TO MODERN INFORMATION-RETRIEVAL - SALTON,G, MCGILL,M [J].
DILLON, M .
INFORMATION PROCESSING & MANAGEMENT, 1983, 19 (06) :402-403
[7]  
Doorenbos R. B., 1997, Proceedings of the First International Conference on Autonomous Agents, P39, DOI 10.1145/267658.267666
[8]  
Goldberg D.E., 1989, OPTIMIZATION MACHINE
[9]  
He H, 2005, LECT NOTES COMPUT SC, V3806, P29
[10]  
He H., 2003, P 29 INT C VER LARG, V29, P357