Adapting information retrieval to query contexts

被引:23
作者
Bai, Jing [1 ]
Nie, Jian-Yun [1 ]
机构
[1] Univ Montreal, DIRO, Montreal, PQ H3C 3J7, Canada
关键词
Information retrieval; Context; Context-dependent relation; Query expansion; Language modeling;
D O I
10.1016/j.ipm.2008.07.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In current IR approaches documents are retrieved only according to the terms specified in the query. The same answers are returned for the same query whatever the user and the search goal are. In reality, many other contextual factors strongly influence document's relevance and they should be taken into account in IR operations. This paper proposes a method, based on language modeling, to integrate several contextual factors so that document ranking will be adapted to the specific query contexts. We will consider three contextual factors in this paper: the topic domain of the query, the characteristics of the document collection, as well as context words within the query. Each contextual factor is used to generate a new query language model to specify some aspect of the information need. All these query models are then combined together to produce a more complete model for the underlying information need. Our experiments on TREC collections show that each contextual factor can positively influence the IR effectiveness and the combined model results in the highest effectiveness. This study shows that it is both beneficial and feasible to integrate more contextual factors in the current IR practice. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1901 / 1922
页数:22
相关论文
共 42 条
[1]  
[Anonymous], P 27 INT ACM SIGIR C
[2]  
[Anonymous], 1996, P 19 ANN INT ACM SIG, DOI DOI 10.1145/243199.243202
[3]  
[Anonymous], P 24 ANN INT ACM SIG, DOI DOI 10.1145/383952.384019
[4]  
BAI J, 2004, AS INF RETR S AIRS
[5]  
BAI J, 2007, P 30 ANN INT ACM SIG, P15
[6]  
Berger A, 1999, SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P222, DOI 10.1145/312624.312681
[7]  
CAO G., 2005, P 28 ANN INT ACM SIG, P298, DOI DOI 10.1145/1076034.1076086
[8]  
Chengxiang Zhai, 2001, Proceedings of the 2001 ACM CIKM. Tenth International Conference on Information and Knowledge Management, P403, DOI 10.1145/502585.502654
[9]  
CROFT WB, 2006, DELOS NSF WORKSH PER, P49
[10]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38