CONTEXTUAL WORD SIMILARITY AND ESTIMATION FROM SPARSE DATA

被引:41
作者
DAGAN, I [1 ]
MARCUS, S [1 ]
MARKOVITCH, S [1 ]
机构
[1] TECHNION ISRAEL INST TECHNOL,DEPT COMP SCI,IL-32000 HAIFA,ISRAEL
关键词
D O I
10.1006/csla.1995.0008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years there is much interest in word co-occurrence relations, such as n-grams, verb-object combinations, or co-occurrence within a limited context. This paper discusses how to estimate the likelihood of cc-occurrences that do not occur in the training data. We present a method that makes local analogies between each specific unobserved co-occurrence and other co-occurrences that contain similar words. These analogies are based on the assumption that similar word cooccurrences have similar values of mutual information. Accordingly, the word similarity metric captures similarities between vectors of mutual information values. Our evaluation suggests that this method performs better than existing, frequency-based, smoothing methods, and may provide an alternative to class-based models. A background survey is included, covering issues of lexical co-occurrence, data sparseness and smoothing, word similarity and clustering, and mutual information.
引用
收藏
页码:123 / 152
页数:30
相关论文
共 38 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]  
ALLEN J, 1995, NATURAL LANGUAGE UND
[3]  
BRILL E, 1990, JUN DARPA SPEECH NAT
[4]  
BROWN PF, 1991, 29TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS : PROCEEDINGS OF THE CONFERENCE, P264
[5]  
Church K. W., 1990, Computational Linguistics, V16, P22
[6]  
Church K. W., 1991, Computer Speech and Language, V5, P19, DOI 10.1016/0885-2308(91)90016-J
[7]  
Church K.W., 1993, COMPUTATIONAL LINGUI, V19, P1
[8]  
Dagan I., 1991, Artificial Intelligence and Computer Vision. Proceedings of the Seventh Israeli Conference, P125
[9]  
DAGAN I, 1993, P 31 ANN M ASS COMP, P164
[10]  
DAGAN I, 1994, P 32 ANN M ASS COMP, P272