Comparing rankings of search results on the Web

被引:45
作者
Bar-Ilan, J [1 ]
机构
[1] Bar Ilan Univ, Dept Informat Sci, IL-52900 Ramat Gan, Israel
[2] Hebrew Univ Jerusalem, Sch Lib Archive & Informat Studies, IL-91904 Jerusalem, Israel
关键词
ranking; comparison; search engines; overlap;
D O I
10.1016/j.ipm.2005.03.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Web has become an information source for professional data gathering. Because of the vast amounts of information on almost all topics, one cannot systematically go over the whole set of results, and therefore must rely on the ordering of the results by the search engine. It is well known that search engines on the Web have low overlap in terms of coverage. In this study we measure how similar are the rankings of search engines on the overlapping results. We compare rankings of results for identical queries retrieved from several search engines. The method is based only on the set of URLs that appear in the answer sets of the engines being compared. For comparing the similarity of rankings of two search engines, the Spearman correlation coefficient is computed. When comparing more than two sets Kendall's W is used. These are well-known measures and the statistical significance of the results can be computed. The methods are demonstrated on a set of 15 queries that were submitted to four large Web search engines. The findings indicate that the large public search engines on the Web employ considerably different ranking algorithms. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1511 / 1519
页数:9
相关论文
共 22 条
[1]  
BHARAT K, 1998, P 7 INT WORLD WID WE, V30, P379
[2]  
BOVE RE, 2002, CORRELATION
[3]  
Cohen J., 1988, STAT POWER ANAL BEHA
[4]   Comparing top k lists [J].
Fagin, R ;
Kumar, R ;
Sivakumar, D .
SIAM JOURNAL ON DISCRETE MATHEMATICS, 2003, 17 (01) :134-160
[5]  
GARSON D, 2004, QUALITATIVE METHODS
[6]  
*GOOGLE, 2004, INF WEBM
[7]   Measuring search engine quality [J].
Hawking, D ;
Craswell, N ;
Bailey, P ;
Griffihs, K .
INFORMATION RETRIEVAL, 2001, 4 (01) :33-59
[8]   Use of electronic resources in scholarly electronic journals: A citation analysis [J].
Herring, SD .
COLLEGE & RESEARCH LIBRARIES, 2002, 63 (04) :334-340
[9]   Free online availability substantially increases a paper's impact [J].
Lawrence, S .
NATURE, 2001, 411 (6837) :521-521
[10]   Accessibility of information on the web [J].
Lawrence, S ;
Giles, CL .
NATURE, 1999, 400 (6740) :107-109