Web Retrieval: Techniques for the Aggregation and Selection of Queries and Answers

被引:5
作者
Nettleton, David [1 ]
Baeza-Yates, Ricardo [1 ,2 ]
机构
[1] Univ Pompeu Fabra, Dept Technol, Barcelona 08003, Spain
[2] Yahoo Res, Barcelona 08018, Spain
关键词
D O I
10.1002/int.20316
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a set of techniques for grouping and aggregating queries and search results, in the context of an Internet "search engine." (1) In the case of the initial grouping of the queries. we consider the Fuzzy c-Means(1) and Kohonen SOM2 techniques. It is proposed that FCM may be more adequate than k-Means(3) for the grouping of certain data types. We evaluate how we can use FCM to calculate the fuzzy membership grades for a set of Web queries and their corresponding results. (2) In the case of the aggregation of data from different information sources (the clustering techniques), we will consider weighted ordered weighted averaging (WOWA).(4) WOWA is used to choose the most adequate cluster and to identify the historical query in that cluster, which is most similar to a new query. We will see that the WOWA operator offers a wide flexibility for data processing. (C) 2008 Wiley Periodicals, Inc.
引用
收藏
页码:1223 / 1234
页数:12
相关论文
共 19 条
[1]  
AHN IJ, 2005, PAI 2005, P24
[2]   Modeling user search behavior [J].
Baeza-Yates, R ;
Hurtado, C ;
Mendoza, M ;
Dupret, G .
THIRD LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2005, :242-251
[3]  
BAEZAYATES R, 2005, ECIR 05
[4]  
BAEZAYATES R, CHIL COMP SCI C SANT
[5]  
BAEZAYATES R, 2001, 10 WORLD WID WEB C H
[6]  
BEZDEK JC, 1981, S13 FEATURE SELECTIO
[7]  
Broder A., 2002, SIGIR Forum, V36, P3, DOI 10.1145/792550.792552
[8]  
Hartigan J. A., 1979, APPLIED STATISTICS, V28, P126
[9]  
Kohonen, 1984, SELF ORG ASS MEMORY
[10]  
LEE U, 2005, P 14 INT WORLD WID W