Coverage, relevance, and ranking: The impact of query operators on web search engine results

被引:43
作者
Eastman, CM [1 ]
Jansen, BJ
机构
[1] Univ S Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] Penn State Univ, Sch Informat Sci & Technol, University Pk, PA 16801 USA
关键词
human factors; performance; experimentation; Relative precision; coverage; ranking; Boolean operators; query operators; search engines; Web results;
D O I
10.1145/944012.944015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research has reported that about 10% of Web searchers utilize advanced query operators, with the other 90% using extremely simple queries. It is often assumed that the use of query operators, such as Boolean operators and phrase searching, improves the effectiveness of Web searching. We test this assumption by examining the effects of query operators on the performance of three major Web search engines. We selected one hundred queries from the transaction log of a Web search service. Each of these original queries contained query operators such as AND, OR, MUST APPEAR (+), or PHRASE (" "). We then removed the operators from these one hundred advanced queries. We submitted both the original and modified queries to three major Web search engines; a total of 600 queries were submitted and 5,748 documents evaluated. We compared the results from the original queries with the operators to the results from the modified queries without the operators. We examined the results for changes in coverage, relative precision, and ranking of relevant documents. The use of most query operators had no significant effect on coverage, relative precision, or ranking, although the effect varied depending on the search engine. We discuss implications for the effectiveness of searching techniques as currently taught, for future information retrieval system design, and for future research.
引用
收藏
页码:383 / 411
页数:29
相关论文
共 64 条
[1]  
[Anonymous], P 4 WORLD MULT SYST
[2]  
[Anonymous], 1994, MANAGING GIGABYTES C
[3]  
*AOL, 2003, GETT START
[4]  
Borgman CL, 1996, J AM SOC INFORM SCI, V47, P493, DOI 10.1002/(SICI)1097-4571(199607)47:7<493::AID-ASI3>3.0.CO
[5]  
2-P
[6]  
Brin S, 1999, LECT NOTES COMPUT SC, V1590, P172
[7]   Predicate rewriting for translating Boolean queries in a heterogeneous information system [J].
Chang, CCK ;
García-Molina, H ;
Paepcke, A .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1999, 17 (01) :1-39
[8]  
CHOWDHURY A, 2002, P IEEE 3 INT C INF T, P8
[9]  
CLARK P, 2001, NET EC, V2, P1
[10]   Shortest-substring retrieval and ranking [J].
Clarke, CLA ;
Cormack, G .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2000, 18 (01) :44-78