Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language

被引:1106
作者
Resnik, P [1 ]
机构
[1] Univ Maryland, Dept Linguist, College Pk, MD 20742 USA
[2] Univ Maryland, Inst Adv Comp Studies, College Pk, MD 20742 USA
关键词
D O I
10.1613/jair.514
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a measure of semantic similarity in an is-a taxonomy based on the notion of shared information content. Experimental evaluation against a benchmark set of human similarity judgments demonstrates that the measure performs better than the traditional edge-counting approach. The article presents algorithms that take advantage of taxonomic similarity in resolving syntactic and semantic ambiguity, along with experimental results demonstrating their effectiveness.
引用
收藏
页码:95 / 130
页数:36
相关论文
共 72 条
[31]  
LAUER M, 1995, THESIS MACQUARIE U S
[32]  
LAUER M, 1994, P 32 ANN M ASS COMP
[33]  
LEACOCK C, 1994, UNPUB FILLING SPARSE
[34]   INFORMATION-RETRIEVAL BASED ON CONCEPTUAL DISTANCE IN IS-A HIERARCHIES [J].
LEE, JH ;
KIM, MH ;
LEE, YJ .
JOURNAL OF DOCUMENTATION, 1993, 49 (02) :188-207
[35]  
Lee LJ, COMPUT THERM SCI
[36]  
Lesk Michael, 1986, Proceedings of the 1986 SIGDOC Conference, P24, DOI DOI 10.1145/318723.318728
[37]  
LI H, 1995, P INT C REC ADV NLP
[38]  
Lin D., 1998, P 15 INT C MACH LEAR, P296
[39]  
Lin D, 1997, P 35 ANN M ASS COMP, P64
[40]  
Marcus Mitchell P., 1993, Computational Linguistics, V19, P313, DOI 10.21236/ADA273556