Experiments on the automatic induction of German semantic verb classes

被引:63
作者
Walde, Sabine Schulte im [1 ]
机构
[1] Univ Saarland, D-6600 Saarbrucken, Germany
关键词
D O I
10.1162/coli.2006.32.2.159
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents clustering experiments on German verbs: A statistical grammar model for German serves as the source for a distributional verb description at the lexical syntax-semantics interface, and the unsupervised clustering algorithm k-means uses the empirical verb properties to perform an automatic induction of verb classes. Various evaluation measures are applied to compare the clustering results to gold standard German semantic verb classes under different criteria. The primary goals of the experiments are (1) to empirically utilize and investigate the well-established relationship between verb meaning and verb behavior within a cluster analysis and (2) to investigate the required technical parameters of a cluster analysis with respect to this specific linguistic task. The clustering methodology is developed on a small-scale verb set and then applied to a larger-scale verb set including 883 German verbs.
引用
收藏
页码:159 / 194
页数:36
相关论文
共 54 条
[1]  
[Anonymous], 2000, Proceedings of the 18th conference on Computational linguistics
[2]  
[Anonymous], 1997, P ACL WORKSHOP AUTOM
[3]  
[Anonymous], 1993, P 31 ANN M ASS COMP
[4]  
[Anonymous], ARTIF INTELL STAT
[5]  
Baker C.F., 1998, P 36 ANN M ASS COMP, P86, DOI DOI 10.3115/980845.980860
[6]  
Baum L.E., 1972, Inequalities III: Proceedings of the Third Symposium on Inequalities, page, V3, P1
[7]  
CARROLL G, 1998, P 3 C EMP METH NAT
[8]  
CHARNIAK E, 1997, P 14 NAT C ART INT
[9]  
CHEN S, 1988, TR1098 HARV U CTR RE
[10]  
Cover TM, 2006, Elements of Information Theory