A Survey of Automatic Query Expansion in Information Retrieval

被引:518
作者
Carpineto, Claudio [1 ]
Romano, Giovanni [1 ]
机构
[1] Fdn Ugo Bordoni, I-00142 Rome, Italy
关键词
Algorithms; Experimentation; Measurement; Performance; Query expansion; query refinement; search; word associations; pseudo-relevance feedback; document ranking; RELEVANCE FEEDBACK; SEARCH; WEB; PERFORMANCE; DICTIONARY; CONTEXT; RULES;
D O I
10.1145/2071389.2071390
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The relative ineffectiveness of information retrieval systems is largely caused by the inaccuracy with which a query formed by a few keywords models the actual user information need. One well known method to overcome this limitation is automatic query expansion (AQE), whereby the user's original query is augmented by new features with a similar meaning. AQE has a long history in the information retrieval community but it is only in the last years that it has reached a level of scientific and experimental maturity, especially in laboratory settings such as TREC. This survey presents a unified view of a large number of recent approaches to AQE that leverage various data sources and employ very different principles and techniques. The following questions are addressed. Why is query expansion so important to improve search effectiveness? What are the main steps involved in the design and implementation of an AQE component? What approaches to AQE are available and how do they compare? Which issues must still be resolved before AQE becomes a standard component of large operational information retrieval systems (e.g., search engines)?
引用
收藏
页数:50
相关论文
共 208 条
[51]  
CAO G., 2007, P 16 C INF KNOWL MAN
[52]  
Carmel D., 2002, Proceedings of SIGIR 2002. Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P283, DOI 10.1145/564376.564427
[53]   Improving retrieval feedback with multiple term-ranking function combination [J].
Carpineto, C ;
Romano, G ;
Bordoni, FU ;
Giannini, V .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (03) :259-290
[54]   An information-theoretic approach to automatic query expansion [J].
Carpineto, C ;
De Mori, R ;
Romano, G ;
Bigi, B .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2001, 19 (01) :1-27
[55]  
Carpineto C., 2004, CONCEPT DATA ANAL TH
[56]   A Survey of Web Clustering Engines [J].
Carpineto, Claudio ;
Osinski, Stanislaw ;
Romano, Giovanni ;
Weiss, Dawid .
ACM COMPUTING SURVEYS, 2009, 41 (03)
[57]   Query reformulation using automatically generated query concepts from a document space [J].
Chang, YJ ;
Ounis, I ;
Kim, M .
INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (02) :453-468
[58]  
Chen L, 2004, 2004 IEEE INTERNATIONAL CONFERNECE ON E-TECHNOLOGY, E-COMMERE AND E-SERVICE, PROCEEDINGS, P317
[59]  
Chengxiang Zhai, 2001, Proceedings of the 2001 ACM CIKM. Tenth International Conference on Information and Knowledge Management, P403, DOI 10.1145/502585.502654
[60]  
Chengxiang Zhai, 2001, SIGIR Forum, P334