The disambiguation of nominalizations

被引:31
作者
Lapata, M [1 ]
机构
[1] Univ Edinburgh, Div Informat, Edinburgh EH8 9LW, Midlothian, Scotland
关键词
D O I
10.1162/089120102760276018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article addresses the interpretation of nominalizations, a particular class of compound nouns whose head noun is derived from a verb and whose modifier is interpreted as an argument of this verb. Any attempt to automatically interpret nominalizations needs to take into account: (a) the selectional constraints imposed by the nominalized compound head, (b) the fact that the relation of the modifier and the head noun can be ambiguous, and (c) the fact that these constraints can be easily overridden by contextual or pragmatic factors. The interpretation of nominalizations poses a further challenge for probabilistic approaches since the argument relations between a head and its modifier are not readily available in the corpus. Even an approximation that maps the compound head to its underlying verb provides insufficient evidence. We present an approach that treats the interpretation task as a disambiguation problem and show how we can "re-create" the missing distributional evidence by exploiting partial parsing, smoothing techniques, and contextual information. We combine these distinct information sources using Ripper, a system that learns sets of rules from data, and achieve an accuracy of 86.1% (over a baseline of 61.5%) on the British National Corpus.
引用
收藏
页码:357 / 388
页数:32
相关论文
共 35 条
[1]  
[Anonymous], P 31 ANN M ASS COMP
[2]  
[Anonymous], P 15 INT C COMP LING
[3]  
[Anonymous], WORKSH ROB PARS 8 EU
[4]  
Brown P. F., 1992, Computational Linguistics, V18, P467
[5]  
Carletta J, 1996, COMPUT LINGUIST, V22, P249
[6]   Word sense disambiguation using automatically acquired verbal preferences [J].
Carroll, J ;
McCarthy, D .
COMPUTERS AND THE HUMANITIES, 2000, 34 (1-2) :109-114
[7]  
Church K. W., 1991, Computer Speech and Language, V5, P19, DOI 10.1016/0885-2308(91)90016-J
[8]  
Cohen WW, 1996, PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, P709
[9]  
COLLINS M, 1995, P 3 WORKSH VER LARG, P27, DOI DOI 10.1177/0075424211421346
[10]   Similarity-based models of word cooccurrence probabilities [J].
Dagan, I ;
Lee, L ;
Pereira, FCN .
MACHINE LEARNING, 1999, 34 (1-3) :43-69