Word sense disambiguation of WordNet glosses

被引:20
作者
Moldovan, D [1 ]
Novischi, A [1 ]
机构
[1] Univ Texas, Dept Comp Sci, Richardson, TX 75083 USA
关键词
D O I
10.1016/j.csl.2004.05.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a suite of methods and results for the semantic disambiguation of WordNet glosses. WordNet is a resource widely used in natural language processing and artificial intelligence. Intended and designed as a lexical database, WordNet exhibits some deficiencies when used as a knowledge base. By semantically disambiguating the words in the glosses, we add pointers from each word to its concept or synset, and this increases the connectivity between the WordNet concepts by approximately an order of magnitude. We show how lexical chains and other applications can be built on this richly connected WordNet. The semantic disambiguation of the WordNet glosses is performed using automatic methods based on a set of heuristics. The precision of the semantic annotation is improved by using voting between the disambiguation system described here and another WSD system. The entire WordNet 2.0 has been disambiguated with an overall precision of 86% and is available at http://xwn.hlt.utdallas.edu. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:301 / 317
页数:17
相关论文
共 17 条
[1]  
AGIRRE E, 1994, METHODOLOGY EXTRACTI
[2]  
Alshawi H., 1987, COMPUTATIONAL LINGUI, V13, P195
[3]  
Brill E, 1995, COMPUT LINGUIST, V21, P543
[4]  
Chodorow M. S., 1985, 23rd Annual Meeting of the Association for Computational Linguistics. Proceedings of the Conference, P299
[5]  
DOLAN WB, 1998, MINDNET ACQUIRING ST
[6]  
HARABAGIU S, 1999, P ACL SIGLEX99 STAND, P1
[7]  
Harabagiu SM, 1998, LANG SPEECH & COMMUN, P379
[8]  
Ide N, 1998, COMPUT LINGUIST, V24, P1
[9]   Framework and results for English SENSEVAL [J].
Kilgarriff, A ;
Rosenzweig, J .
COMPUTERS AND THE HUMANITIES, 2000, 34 (1-2) :15-48
[10]  
LESK M, 1986, P ACM SIG DOC C ONT