A METHOD FOR DISAMBIGUATING WORD SENSES IN A LARGE CORPUS

被引:176
作者
GALE, WA [1 ]
CHURCH, KW [1 ]
YAROWSKY, D [1 ]
机构
[1] AT&T BELL LABS,MURRAY HILL,NJ 07974
来源
COMPUTERS AND THE HUMANITIES | 1992年 / 26卷 / 5-6期
关键词
CONTEXT; MEANING; SENSE; DISCRIMINATION; AMBIGUITY; POLYSEMY; BILINGUAL;
D O I
10.1007/BF00136984
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Word sense disambiguation has been recognized as a major problem in natural language processing research for over forty years. Both quantitive and qualitative methods have been tried, but much of this work has been stymied by difficulties in acquiring appropriate lexical resources. The availability of this testing and training material has enabled us to develop quantitative disambiguation methods that achieve 92% accuracy in discriminating between two very distinct senses of a noun. In the training phase, we collect a number of instances of each sense of the polysemous noun. Then in the testing phase, we are given a new instance of the noun, and are asked to assign the instance to one of the senses. We attempt to answer this question by comparing the context of the unknown instance with contexts of known instances using a Bayesian argument that has been applied successfully in related tasks such as author identification and information retrieval. The proposed method is probably most appropriate for those aspects of sense disambiguation that are closest to the information retrieval task. In particular, the proposed method was designed to disambiguate senses that are usually associated with different topics.
引用
收藏
页码:415 / 439
页数:25
相关论文
共 37 条
[1]  
BARHILLEL, 1960, ADV COMPUTERS
[3]  
BLACK E, 1987, THESIS CITY U NEW YO
[4]  
BROWN PF, 1991, 29TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS : PROCEEDINGS OF THE CONFERENCE, P169
[5]  
BROWN PF, 1991, 29TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS : PROCEEDINGS OF THE CONFERENCE, P264
[6]  
CHOUEKA Y, 1985, COMPUT HUMANITIES, V19, P149
[7]  
CHURCH K, 1989, P IEEE INT C ACOUSTI
[8]  
Cruse D.A., 1986, LEXICAL SEMANTICS
[9]  
DAGAN I, 1991, 29TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS : PROCEEDINGS OF THE CONFERENCE, P130
[10]  
FILLMORE C, 1991, 29TH ANN M ASS COMP