Word sense disambiguation using semantic relatedness measurement

被引:8
作者
Yang C.-Y. [1 ]
机构
[1] Department of Computer Science and Information Engineering, Tamkang University
来源
Journal of Zhejiang University-SCIENCE A | 2006年 / 7卷 / 10期
关键词
Natural language processing; Semantic relatedness; Word sense disambiguation (WSD); WordNet;
D O I
10.1631/jzus.2006.A1609
中图分类号
学科分类号
摘要
All human languages have words that can mean different things in different contexts, such words with multiple meanings are potentially 'ambiguous'. The process of 'deciding which of several meanings of a term is intended in a given context' is known as 'word sense disambiguation (WSD)'. This paper presents a method of WSD that assigns a target word the sense that is most related to the senses of its neighbor words. We explore the use of measures of relatedness between word senses based on a novel hybrid approach. First, we investigate how to 'literally' and 'regularly' express a 'concept'. We apply set algebra to WordNet's synsets cooperating with WordNet's word ontology. In this way we establish regular rules for constructing various representations (lexical notations) of a concept using Boolean operators and word forms in various synset(s) defined in WordNet. Then we establish a formal mechanism for quantifying and estimating the semantic relatedness between concepts-we facilitate 'concept distribution statistics' to determine the degree of semantic relatedness between two lexically expressed concepts. The experimental results showed good performance on Semcor, a subset of Brown corpus. We observe that measures of semantic relatedness are useful sources of information for WSD.
引用
收藏
页码:1609 / 1625
页数:16
相关论文
共 29 条
[11]  
Leacock C., Martin C., Combining local context with wordnet similarity for word sense identification, WordNet: A Lexical Reference System and Its Application, (1998)
[12]  
Lee J.H., Kim M.H., Lee Y.I., Information retrieval based on conceptual distance in IS-A hierarchies, Journal of Documentation, 49, 2, pp. 188-207, (1993)
[13]  
Lesk M., Automatic sense disambiguation using machine readable dictionaries: how to tell a pine code from an ice cream cone, Proceedings of the 5th Annual International Conference on Systems Documentation, pp. 24-26, (1986)
[14]  
Li H., Li C., Word translation disambiguation using bilingual bootstrapping, Computational Linguistics, 30, 1, pp. 1-22, (2004)
[15]  
Lin D., An information-theoretic definition of similarity, Proceedings of the International Conference on Machine Learning, (1998)
[16]  
Lin D., A case-base algorithm for word sense disambiguation, Proceedings of Conference Pacific Association for Computational Linguistics, (1999)
[17]  
Lin D., Word sense disambiguation with a similarity based smoothed library, Computers and the Humanities: Special Issue on SENSEVAL, 34, 1-2, pp. 147-152, (2000)
[18]  
Miller G.A., WordNet: A lexical database, Comm. ACM, 38, 11, pp. 39-41, (1995)
[19]  
Miller G.A., Beckwith R., Fellbaum C., Gross D., Miller K., Introduction to WordNet: An on-line lexical database, International Journal of Lexicography, 3, 4, pp. 235-312, (1990)
[20]  
Moldovan D., Mihalcea R., Using WordNet and lexical operators to improve Internet searches, IEEE Internet Computing, 4, 1, pp. 34-43, (2000)