Using Statistical Term Similarity for Sense Disambiguation in Cross-Language Information Retrieval

被引:11
作者
Mirna Adriani
机构
[1] University of Glasgow,Department of Computing Science
来源
Information Retrieval | 2000年 / 2卷 / 1期
关键词
cross-language information retrieval; term disambiguation;
D O I
10.1023/A:1009989801965
中图分类号
学科分类号
摘要
With the increasing availability of machine-readable bilingual dictionaries, dictionary-based automatic query translation has become a viable approach to Cross-Language Information Retrieval (CLIR). In this approach, resolving term ambiguity is a crucial step. We propose a sense disambiguation technique based on a term-similarity measure for selecting the right translation sense of a query term. In addition, we apply a query expansion technique which is also based on the term similarity measure to improve the effectiveness of the translation queries. The results of our Indonesian to English and English to Indonesian CLIR experiments demonstrate the effectiveness of the sense disambiguation technique. As for the query expansion technique, it is shown to be effective as long as the term ambiguity in the queries has been resolved. In the effort to solve the term ambiguity problem, we discovered that differences in the pattern of word-formation between the two languages render query translations from one language to the other difficult.
引用
收藏
页码:71 / 82
页数:11
相关论文
共 1 条
[1]  
Salton G(1970)Automatic processing of foreign language documents Journal of the American Society for Information Science 21 187-194