Beyond kappa: A review of interrater agreement measures

被引:626
作者
Banerjee, M
机构
[1] Wayne State Univ, Sch Med, Ctr Healthcare Effectiveness Res, Detroit, MI 48201 USA
[2] Univ New Hampshire, Dept Math, Durham, NH 03824 USA
来源
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE | 1999年 / 27卷 / 01期
关键词
kappa coefficient; intraclass correlation; log-linear models; nominal data; ordinal data;
D O I
10.2307/3315487
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In 1960, Cohen introduced the kappa coefficient to measure chance-corrected nominal scale agreement between two raters. Since then, numerous extensions and generalizations of this inter-rater agreement measure have been proposed in the literature. This paper reviews and critiques various approaches to the study of interrater agreement, for which the relevant data comprise either nominal or ordinal categorical ratings from multiple raters. It presents a comprehensive compilation of the main statistical approaches to this problem, descriptions and characterizations of the underlying models, and discussions of related statistical methodologies for estimation and confidence-interval construction. The emphasis is on various practical scenarios and designs that underlie the development of these measures, and the interrelationships between them.
引用
收藏
页码:3 / 23
页数:21
相关论文
共 63 条
[1]  
Agresti A, 1992, Stat Methods Med Res, V1, P201, DOI 10.1177/096228029200100205
[2]   QUASI-SYMMETRICAL LATENT CLASS MODELS, WITH APPLICATION TO RATER AGREEMENT [J].
AGRESTI, A ;
LANG, JB .
BIOMETRICS, 1993, 49 (01) :131-139
[3]   A MODEL FOR AGREEMENT BETWEEN RATINGS ON AN ORDINAL SCALE [J].
AGRESTI, A .
BIOMETRICS, 1988, 44 (02) :539-548
[4]   MAXIMUM-LIKELIHOOD-ESTIMATION OF AGREEMENT IN THE CONSTANT PREDICTIVE PROBABILITY MODEL, AND ITS RELATION TO COHEN KAPPA [J].
AICKIN, M .
BIOMETRICS, 1990, 46 (02) :293-302
[5]  
[Anonymous], 1988, ANN STAT
[6]   A COMPARISON OF METHODS FOR CALCULATING A STRATIFIED-KAPPA [J].
BARLOW, W ;
LAI, MY ;
AZEN, SP .
STATISTICS IN MEDICINE, 1991, 10 (09) :1465-1472
[7]   Measurement of interrater agreement with adjustment for covariates [J].
Barlow, W .
BIOMETRICS, 1996, 52 (02) :695-702
[8]   LATENT VARIABLE MODELS FOR ORDERED CATEGORICAL-DATA [J].
BARTHOLOMEW, DJ .
JOURNAL OF ECONOMETRICS, 1983, 22 (1-2) :229-243
[9]   2X2 KAPPA-COEFFICIENTS - MEASURES OF AGREEMENT OR ASSOCIATION [J].
BLOCH, DA ;
KRAEMER, HC .
BIOMETRICS, 1989, 45 (01) :269-287
[10]   BIAS, PREVALENCE AND KAPPA [J].
BYRT, T ;
BISHOP, J ;
CARLIN, JB .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1993, 46 (05) :423-429