Combining multiple strategies for effective monolingual and cross-language retrieval

被引:25
作者
Savoy, J [1 ]
机构
[1] Univ Neuchatel, Inst Interfac Informat, CH-2000 Neuchatel, Switzerland
来源
INFORMATION RETRIEVAL | 2004年 / 7卷 / 1-2期
关键词
cross-language information retrieval; bilingual information retrieval; collection merging strategies; evaluation; learning curve;
D O I
10.1023/B:INRT.0000009443.51912.e7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes and evaluates different retrieval strategies that are useful for search operations on document collections written in various European languages, namely French, Italian, Spanish and German. We also suggest and evaluate different query translation schemes based on freely available translation resources. In order to cross language barriers, we propose a combined query translation approach that has resulted in interesting retrieval effectiveness. Finally, we suggest a collection merging strategy based on logistic regression that tends to perform better than other merging approaches.
引用
收藏
页码:121 / 148
页数:28
相关论文
共 57 条
[1]  
AMATI G, 2003, CROSS LANGUAGE INFOR, P257
[2]  
[Anonymous], P 16 ANN INT ACM SIG
[3]  
[Anonymous], P 18 INT ACM SIGIR C
[4]  
Ballesteros L., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P64, DOI 10.1145/290941.290958
[5]  
BRAND R, 2003, CROSS LANGUAGE INFOR, P59
[6]  
Braschler M, 2002, LECT NOTES COMPUT SC, V2406, P394
[7]  
Braschler M, 2002, LECT NOTES COMPUT SC, V2406, P102
[8]  
BRASCHLER M, 2001, LNCS, V2069, P140
[9]  
BRASCHLER M, 2003, CROSS LANGUAGE INFOR, P164
[10]  
BUCKLEY C, 1996, NIST PUBLICATION, P25