Evaluating WordNet-based measures of lexical semantic relatedness

被引:62
作者
Budanitsky, Alexander [1 ]
Hirst, Graeme [1 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G4, Canada
关键词
D O I
10.1162/089120106776173093
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The quantification of lexical semantic relatedness has many applications in NLP, and many different measures have been proposed. We evaluate five of these measures, all of which use WordNet as their central resource, by comparing their performance in detecting and correcting real-word spelling errors. An information-content-based measure proposed by Jiang and Conrath is found superior to those proposed by Hirst and St-Onge, Leacock and Chodorow, Lin, and Resnik. In addition, we explain why distributional similarity is not an adequate proxy for lexical semantic relatedness.
引用
收藏
页码:13 / 47
页数:35
相关论文
共 58 条
[1]  
Agresti A., 1997, STAT METHODS SOCIAL, VThird
[2]  
[Anonymous], 1993, P 6 C EUROPEAN CHAPT
[3]  
[Anonymous], 1997, PROC 10 RES COMPUTAT
[4]  
Banerjee S., 2003, P 18 INT JOINT C ART, V3, P805
[5]   INTRACONCEPT SIMILARITY AND ITS IMPLICATIONS FOR INTERCONCEPT SIMILARITY [J].
BARSALOU, LW .
SIMILARITY AND ANALOGICAL REASONING, 1989, :76-121
[6]   AD HOC CATEGORIES [J].
BARSALOU, LW .
MEMORY & COGNITION, 1983, 11 (03) :211-227
[7]  
Bernard J, 1986, MACQUARIE THESAURUS
[8]  
BUDANITSKY A, 1999, CSRG390 U TOR
[9]  
Budanitsky A., 2001, Workshop on WordNet and Other Lexical Resources, P29
[10]  
CORLEY C, 2005, P ACL WORKSH EMP MOD, P13