Beyond kappa: A review of interrater agreement measures

被引:626
作者
Banerjee, M
机构
[1] Wayne State Univ, Sch Med, Ctr Healthcare Effectiveness Res, Detroit, MI 48201 USA
[2] Univ New Hampshire, Dept Math, Durham, NH 03824 USA
来源
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE | 1999年 / 27卷 / 01期
关键词
kappa coefficient; intraclass correlation; log-linear models; nominal data; ordinal data;
D O I
10.2307/3315487
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In 1960, Cohen introduced the kappa coefficient to measure chance-corrected nominal scale agreement between two raters. Since then, numerous extensions and generalizations of this inter-rater agreement measure have been proposed in the literature. This paper reviews and critiques various approaches to the study of interrater agreement, for which the relevant data comprise either nominal or ordinal categorical ratings from multiple raters. It presents a comprehensive compilation of the main statistical approaches to this problem, descriptions and characterizations of the underlying models, and discussions of related statistical methodologies for estimation and confidence-interval construction. The emphasis is on various practical scenarios and designs that underlie the development of these measures, and the interrelationships between them.
引用
收藏
页码:3 / 23
页数:21
相关论文
共 63 条
[41]   Distribution and heritability of recurrent ear infections [J].
Kvaerner, KJ ;
Tambs, K ;
Harris, JR ;
Magnus, P .
ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 1997, 106 (08) :624-632
[42]   ONE-WAY COMPONENTS OF VARIANCE MODEL FOR CATEGORICAL DATA [J].
LANDIS, JR ;
KOCH, GG .
BIOMETRICS, 1977, 33 (04) :671-679
[43]   MEASUREMENT OF OBSERVER AGREEMENT FOR CATEGORICAL DATA [J].
LANDIS, JR ;
KOCH, GG .
BIOMETRICS, 1977, 33 (01) :159-174
[44]   LONGITUDINAL DATA-ANALYSIS USING GENERALIZED LINEAR-MODELS [J].
LIANG, KY ;
ZEGER, SL .
BIOMETRIKA, 1986, 73 (01) :13-22
[45]  
LIGHT RJ, 1971, PSYCHOL BULL, V76, P365, DOI 10.1037/h0031643
[46]   A CONCORDANCE CORRELATION-COEFFICIENT TO EVALUATE REPRODUCIBILITY [J].
LIN, LI .
BIOMETRICS, 1989, 45 (01) :255-268
[47]   MISINTERPRETATION AND MISUSE OF THE KAPPA-STATISTIC [J].
MACLURE, M ;
WILLETT, WC .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1987, 126 (02) :161-169
[48]   GENERAL OBSERVER-AGREEMENT MEASURES ON INDIVIDUAL SUBJECTS AND GROUPS OF SUBJECTS [J].
OCONNELL, DL ;
DOBSON, AJ .
BIOMETRICS, 1984, 40 (04) :973-983
[49]   ESTIMATING KAPPA FROM BINOCULAR DATA [J].
ODEN, NL .
STATISTICS IN MEDICINE, 1991, 10 (08) :1303-1311
[50]  
Pearson K, 1901, PHILOS T R SOC LOND, V195, P1