Retrieval Effectiveness of Machine Translated Queries

被引:8
作者
Dolamic, Ljiljana [1 ]
Savoy, Jacques [1 ]
机构
[1] Univ Neuchatel, Dept Comp Sci, CH-2009 Neuchatel, Switzerland
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2010年 / 61卷 / 11期
关键词
LANGUAGE INFORMATION-RETRIEVAL; MODELS; WEB;
D O I
10.1002/asi.21337
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article describes and evaluates various information retrieval models used to search document collections written in English through submitting queries written in various other languages, either members of the Indo-European family (English, French, German, and Spanish) or radically different language groups such as Chinese. This evaluation method involves searching a rather large number of topics (around 300) and using two commercial machine translation systems to translate across the language barriers. In this study, mean average precision is used to measure variances in retrieval effectiveness when a query language differs from the document language. Although performance differences are rather large for certain languages pairs, this does not mean that bilingual search methods are not commercially viable. Causes of the difficulties incurred when searching or during translation are analyzed and the results of concrete examples are explained.
引用
收藏
页码:2266 / 2273
页数:8
相关论文
共 28 条
[1]   Probabilistic models of information retrieval based on measuring the divergence from randomness [J].
Amati, G ;
Van Rijsbergen, CJ .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (04) :357-389
[2]  
[Anonymous], 2008, Introduction to information retrieval
[3]  
[Anonymous], 2005, Experiment and Evaluation in Information Retrieval
[4]  
Ballesteros L, 1997, PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P84, DOI 10.1145/278459.258540
[5]  
BRASCHLER M, 2001, LECT NOTES COMPUTER, V3237, P7
[6]  
BRASCHLER M, 2004, LECT NOTES COMPUTER, V2337, P7
[7]  
Buckley C., 1996, P TREC 4 NIST GAITH, P25
[8]   Multilingual information retrieval using machine translation, relevance feedback and decompounding [J].
Chen, A ;
Gey, FC .
INFORMATION RETRIEVAL, 2004, 7 (1-2) :149-182
[9]   Web searching in a multilingual world [J].
Chung, Wingyan .
COMMUNICATIONS OF THE ACM, 2008, 51 (05) :32-40
[10]  
HARMAN DK, 2005, TREC EXPT EVALUATION, P79