A Survey of Automatic Query Expansion in Information Retrieval

被引:518
作者
Carpineto, Claudio [1 ]
Romano, Giovanni [1 ]
机构
[1] Fdn Ugo Bordoni, I-00142 Rome, Italy
关键词
Algorithms; Experimentation; Measurement; Performance; Query expansion; query refinement; search; word associations; pseudo-relevance feedback; document ranking; RELEVANCE FEEDBACK; SEARCH; WEB; PERFORMANCE; DICTIONARY; CONTEXT; RULES;
D O I
10.1145/2071389.2071390
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The relative ineffectiveness of information retrieval systems is largely caused by the inaccuracy with which a query formed by a few keywords models the actual user information need. One well known method to overcome this limitation is automatic query expansion (AQE), whereby the user's original query is augmented by new features with a similar meaning. AQE has a long history in the information retrieval community but it is only in the last years that it has reached a level of scientific and experimental maturity, especially in laboratory settings such as TREC. This survey presents a unified view of a large number of recent approaches to AQE that leverage various data sources and employ very different principles and techniques. The following questions are addressed. Why is query expansion so important to improve search effectiveness? What are the main steps involved in the design and implementation of an AQE component? What approaches to AQE are available and how do they compare? Which issues must still be resolved before AQE becomes a standard component of large operational information retrieval systems (e.g., search engines)?
引用
收藏
页数:50
相关论文
共 208 条
[1]  
AGICHTEIN E., 2004, ACM T INTERNET TECHN, V4, P1299
[2]  
AGIRRE E., 2009, P CLEF
[3]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[4]  
Allan J., 1996, SIGIR Forum, P270
[5]  
Amati G, 2003, LECT NOTES COMPUT SC, V3237, P310
[6]  
Amati G, 2004, LECT NOTES COMPUT SC, V2997, P127
[7]  
Amati G., 2001, 10 TEXT RETRIEVAL C, P182
[8]  
Amati G., 2003, Probability models for information retrieval based on divergence from randomness
[9]   A SPREADING ACTIVATION THEORY OF MEMORY [J].
ANDERSON, JR .
JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1983, 22 (03) :261-295
[10]  
[Anonymous], 2008, P 31 ANN INT ACM SIG, DOI DOI 10.1145/1390334.1390377