Coverage, relevance, and ranking: The impact of query operators on web search engine results

被引:43
作者
Eastman, CM [1 ]
Jansen, BJ
机构
[1] Univ S Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] Penn State Univ, Sch Informat Sci & Technol, University Pk, PA 16801 USA
关键词
human factors; performance; experimentation; Relative precision; coverage; ranking; Boolean operators; query operators; search engines; Web results;
D O I
10.1145/944012.944015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research has reported that about 10% of Web searchers utilize advanced query operators, with the other 90% using extremely simple queries. It is often assumed that the use of query operators, such as Boolean operators and phrase searching, improves the effectiveness of Web searching. We test this assumption by examining the effects of query operators on the performance of three major Web search engines. We selected one hundred queries from the transaction log of a Web search service. Each of these original queries contained query operators such as AND, OR, MUST APPEAR (+), or PHRASE (" "). We then removed the operators from these one hundred advanced queries. We submitted both the original and modified queries to three major Web search engines; a total of 600 queries were submitted and 5,748 documents evaluated. We compared the results from the original queries with the operators to the results from the modified queries without the operators. We examined the results for changes in coverage, relative precision, and ranking of relevant documents. The use of most query operators had no significant effect on coverage, relative precision, or ranking, although the effect varied depending on the search engine. We discuss implications for the effectiveness of searching techniques as currently taught, for future information retrieval system design, and for future research.
引用
收藏
页码:383 / 411
页数:29
相关论文
共 64 条
[11]  
COOPER WS, 1968, AM DOC, V19, P355
[12]  
CRASWELL N, 2001, P 24 ANN INT ACM SIG, P250
[13]  
Cronen-Townsend S., 2002, Proceedings of SIGIR 2002. Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P299
[14]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
[15]  
2-9
[16]  
DING W, 1996, P 59 ANN M AM SOC IN, P136
[17]  
DUMAIS C, 2001, P ACM SIGCHI C HUM F, P277
[18]  
DUMAIS S, 2002, 11 INT WORLD WID WEB
[19]   30,000 hits may be better than 300: Precision anomalies in Internet searches [J].
Eastman, CM .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (11) :879-882
[20]   Web search strategies and approaches to studying [J].
Ford, N ;
Miller, D ;
Moss, N .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (06) :473-489