Improving the effectiveness of information retrieval with local context analysis

被引:301
作者
Xu, JX
Croft, WB
机构
[1] BBN Technol, Cambridge, MA 02138 USA
[2] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
基金
英国医学研究理事会;
关键词
experimentation; performance; cooccurrence; document analysis; feedback; global techniques; information retrieval; local context analysis; local techniques;
D O I
10.1145/333135.333138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Techniques for automatic query expansion have been extensively studied in information retrieval research as a means of addressing the word mismatch between queries and documents. These techniques can be categorized as either global or local. While global techniques rely on analysis of a whole collection to discover word relationships, local techniques emphasize analysis of the top-ranked documents retrieved for a query. While local techniques have shown to be more effective than global techniques in general, existing local techniques are not robust and can seriously hurt retrieval when few of the retrieved documents are relevant. We propose a new technique, called local context analysis, which selects expansion terms based on cooccurrence with the query terms within the top-ranked documents. Experiments on a number of collections, both English and non-English, show that local context analysis offers more effective and consistent retrieval results.
引用
收藏
页码:79 / 112
页数:34
相关论文
共 47 条
[1]  
Allan J., 1998, Sixth Text REtrieval Conference (TREC-6) (NIST SP 500-240), P169
[2]  
[Anonymous], P 16 ANN INT ACM SIG
[3]  
[Anonymous], 1995, P 4 TEXT RETR C TREC
[4]  
[Anonymous], P 21 ANN INT ACM SIG
[5]  
[Anonymous], 1996, P 19 ANN INT ACM SIG, DOI DOI 10.1145/243199.243202
[6]   LOCAL FEEDBACK IN FULL-TEXT RETRIEVAL SYSTEMS [J].
ATTAR, R ;
FRAENKEL, AS .
JOURNAL OF THE ACM, 1977, 24 (03) :397-417
[7]  
Ballesteros L, 1997, PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P84, DOI 10.1145/278459.258540
[8]  
BROGLIO J, 1995, P 3 TEXT RETR C TREC, P22
[9]  
Broglio John., 1994, Proceedings of the TIPSTER Text Program, P47
[10]  
BUCKLEY C, 1995, P 3 TEXT RETR C TREC, P69