Computing inter-rater reliability and its variance in the presence of high agreement

被引:1200
作者
Gwet, Kilem Li [1 ]
机构
[1] STATAXIS Consulting, Gaithersburg, MD 20886 USA
关键词
D O I
10.1348/000711006X126600
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Pi (pi) and kappa (kappa) statistics are widely used in the areas of psychiatry and psychological testing to compute the extent of agreement between raters on nominally scaled data. It is a fact that these coefficients occasionally yield unexpected results in situations known as the paradoxes of kappa. This paper explores the origin of these limitations, and introduces an alternative and more stable agreement coefficient referred to as the AC(1) coefficient. Also proposed are new variance estimators for the multiple-rater generalized pi and AC(1) statistics, whose validity does not depend upon the hypothesis of independence between raters. This is an improvement over existing alternative variances, which depend on the independence assumption. A Monte-Carlo simulation study demonstrates the validity of these variance estimators for confidence interval construction, and confirms the value of AC(1) as an improved alternative to existing inter-rater reliability statistics.
引用
收藏
页码:29 / 48
页数:20
相关论文
共 18 条
[1]  
[Anonymous], 2011, Categorical data analysis
[2]   Beyond kappa: A review of interrater agreement measures [J].
Banerjee, M .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1999, 27 (01) :3-23
[3]  
Bishop M.M., 1975, DISCRETE MULTIVARIAT
[4]   HIGH AGREEMENT BUT LOW KAPPA .2. RESOLVING THE PARADOXES [J].
CICCHETTI, DV ;
FEINSTEIN, AR .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1990, 43 (06) :551-558
[5]  
Cochran WG., 1963, SAMPLING TECHNIQUES
[7]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[8]   INTEGRATION AND GENERALIZATION OF KAPPAS FOR MULTIPLE RATERS [J].
CONGER, AJ .
PSYCHOLOGICAL BULLETIN, 1980, 88 (02) :322-328
[9]   LARGE SAMPLE STANDARD ERRORS OF KAPPA AND WEIGHTED KAPPA [J].
FLEISS, JL ;
COHEN, J ;
EVERITT, BS .
PSYCHOLOGICAL BULLETIN, 1969, 72 (05) :323-&
[10]  
FLEISS JL, 1971, PSYCHOL BULL, V76, P378, DOI 10.1037/h0031619