Inter-Coder Agreement for Computational Linguistics

被引:792
作者
Artstein, Ron
Poesio, Massimo [1 ,2 ]
机构
[1] Univ Essex, Dept Comp & Elect Syst, Colchester CO4 3SQ, Essex, England
[2] Univ Trento, CIMeC, I-38068 Rovereto, TN, Italy
关键词
D O I
10.1162/coli.07-034-R2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article is a survey of methods for measuring agreement among corpus annotators. It exposes the mathematics and underlying assumptions of agreement coefficients, covering Krippendorff's alpha as well as Scott's pi and Cohen's kappa; discusses the use of coefficients in several annotation tasks; and argues that weighted, alpha-like coefficients, traditionally less used than kappa-like measures in computational linguistics, may be more appropriate for many corpus annotation tasks-but that their use makes the interpretation of the value of the coefficient even harder.
引用
收藏
页码:555 / 596
页数:42
相关论文
共 109 条
[1]  
Allen J., 1997, Damsl: Dialogue act markup in several layers (draft 2.1)
[2]  
[Anonymous], 2006, Proceedings of the 10th Workshop on the Semantics and Pragmatics of Dialogue
[3]  
[Anonymous], 1998, THESIS U PENNSYLVANI
[4]  
[Anonymous], 62 ISSCO U GEN
[5]  
[Anonymous], P 2 INT C LANG RES E
[6]  
[Anonymous], 2004, Proceedings of the ACL Workshop on Discourse Annotation
[7]  
[Anonymous], 1988, Nonparametric statistics for the behavioural sciences
[8]  
[Anonymous], 2004, P HLT NAACL 2004 SHO
[9]  
ARTSTEIN R, 2005, P FG MOL 2005 ED, P141
[10]  
ATKINS S, 1992, ACTA LINGUIST HUNGAR, V41, P5