Word sense disambiguation using semantic relatedness measurement

被引:8
作者
Yang C.-Y. [1 ]
机构
[1] Department of Computer Science and Information Engineering, Tamkang University
来源
Journal of Zhejiang University-SCIENCE A | 2006年 / 7卷 / 10期
关键词
Natural language processing; Semantic relatedness; Word sense disambiguation (WSD); WordNet;
D O I
10.1631/jzus.2006.A1609
中图分类号
学科分类号
摘要
All human languages have words that can mean different things in different contexts, such words with multiple meanings are potentially 'ambiguous'. The process of 'deciding which of several meanings of a term is intended in a given context' is known as 'word sense disambiguation (WSD)'. This paper presents a method of WSD that assigns a target word the sense that is most related to the senses of its neighbor words. We explore the use of measures of relatedness between word senses based on a novel hybrid approach. First, we investigate how to 'literally' and 'regularly' express a 'concept'. We apply set algebra to WordNet's synsets cooperating with WordNet's word ontology. In this way we establish regular rules for constructing various representations (lexical notations) of a concept using Boolean operators and word forms in various synset(s) defined in WordNet. Then we establish a formal mechanism for quantifying and estimating the semantic relatedness between concepts-we facilitate 'concept distribution statistics' to determine the degree of semantic relatedness between two lexically expressed concepts. The experimental results showed good performance on Semcor, a subset of Brown corpus. We observe that measures of semantic relatedness are useful sources of information for WSD.
引用
收藏
页码:1609 / 1625
页数:16
相关论文
共 29 条
[1]  
Agirre E., Rigau G., Word sense disambiguation using conceptual density, Proceedings of the 16th International Conference on Computational Linguistics (Coling'96), pp. 16-22, (1996)
[2]  
Banerjee S., Pedersen T., An adapted lesk algorithm for word sense disambiguation using wordnet, Proceedings of the Third International Conference on Intelligent Text Processing and Computational Linguistics, pp. 136-145, (2002)
[3]  
Bruce B., Wiebe J., A new approach to sense identication, ARPA Workshop on Human Language Technology, (1994)
[4]  
Chua S., Kulathuramaiyer N., Semantic feature selection using WordNet, Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence (WI 2004), pp. 166-172, (2004)
[5]  
Fellbaum C., An Electronic Lexical Database, (1998)
[6]  
Hirst G., St-Onge D., Lexical chains as representations of context for the detection and correction of malapropisms, WordNet: An Electronic Lexical Database, (1998)
[7]  
Jiang J.J., Conrath D.W., Semantic similarity based on corpus statistics and lexical taxonomy, Proceedings of ROCLING X (1997) International Conference on Research in Computational Linguistics, (1997)
[8]  
Kilgarriff A., I don't believe in word senses, Computers and the Humanities, 31, 2, pp. 91-113, (1997)
[9]  
Kilgarriff A., Rosenzweig J., Framework and results for English SENSEVAL, Computers and the Humanities: Special Issue on SENSEVAL, 34, 1-2, pp. 15-48, (2000)
[10]  
Kim S.B., Seo H.C., Rim H.C., Information retrieval using word senses: Root sense tagging approach, Proceedings of the 27th Annual International Conference on Research and Development in Information Retrieval (SIGIR'04), pp. 258-265, (2004)