Nominalization and Alternations in Biomedical Language

被引:32
作者
Cohen, K. Bretonnel [1 ,2 ]
Palmer, Martha [2 ]
Hunter, Lawrence [1 ]
机构
[1] Univ Colorado, Sch Med, Ctr Computat Pharmacol, Aurora, CO USA
[2] Univ Colorado, Dept Linguist, Boulder, CO USA
来源
PLOS ONE | 2008年 / 3卷 / 09期
关键词
D O I
10.1371/journal.pone.0003158
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: This paper presents data on alternations in the argument structure of common domain-specific verbs and their associated verbal nominalizations in the PennBioIE corpus. Alternation is the term in theoretical linguistics for variations in the surface syntactic form of verbs, e. g. the different forms of stimulate in FSH stimulates follicular development and follicular development is stimulated by FSH. The data is used to assess the implications of alternations for biomedical text mining systems and to test the fit of the sublanguage model to biomedical texts. Methodology/Principal Findings: We examined 1,872 tokens of the ten most common domain-specific verbs or their zero-related nouns in the PennBioIE corpus and labelled them for the presence or absence of three alternations. We then annotated the arguments of 746 tokens of the nominalizations related to these verbs and counted alternations related to the presence or absence of arguments and to the syntactic position of non-absent arguments. We found that alternations are quite common both for verbs and for nominalizations. We also found a previously undescribed alternation involving an adjectival present participle. Conclusions/Significance: We found that even in this semantically restricted domain, alternations are quite common, and alternations involving nominalizations are exceptionally diverse. Nonetheless, the sublanguage model applies to biomedical language. We also report on a previously undescribed alternation involving an adjectival present participle.
引用
收藏
页数:21
相关论文
共 60 条
[1]  
ADAM M, 2004, P 2 ACL WORKSH MULT, P96
[2]  
ADAM M, ANNOTATION GUI UNPUB
[3]  
ADAM M, 2004, P LREC 2004, P803
[4]  
ADAM M, 2004, NOMBANK PROJECT INTE, P24
[5]  
[Anonymous], 2006, P WORKSHOP FRONTIERS
[6]  
[Anonymous], ENGLISH VERB CLASSES
[7]  
[Anonymous], 1999, LONGMAN GRAMMAR SPOK
[8]  
[Anonymous], 1985, COMPREHENSIVE GRAMMA, DOI DOI 10.1177/007542428702000108
[9]  
BARBARA HP, 1993, MATH METHODS LINGUIS
[10]  
BENGOERTZEL, 2006, P BIONLP 06 WORKSH L, P104