HOW RELIABLE ARE CHANCE-CORRECTED MEASURES OF AGREEMENT

被引:55
作者
GUGGENMOOSHOLZMANN, I
机构
[1] Department of Biostatistics and Medical Informatics, Free University of Berlin, Berlin, 1000
关键词
D O I
10.1002/sim.4780122305
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Chance-corrected measures of agreement are prone to exhibit paradoxical and counter-intuitive results when used as measures of reliability. It is demonstrated that these problems arise with Cohen's kappa as well as with Aickin's alpha. They are the consequence of an analogy to Simpson's paradox in mixed populations. It is further shown that chance-corrected measures of agreement may yield misleading values for binary ratings. It is concluded that improvements in the design and the analysis of reliability studies are a pre-requisite for valid and pertinent results.
引用
收藏
页码:2191 / 2205
页数:15
相关论文
共 33 条
[1]   A MODEL FOR AGREEMENT BETWEEN RATINGS ON AN ORDINAL SCALE [J].
AGRESTI, A .
BIOMETRICS, 1988, 44 (02) :539-548
[2]   MAXIMUM-LIKELIHOOD-ESTIMATION OF AGREEMENT IN THE CONSTANT PREDICTIVE PROBABILITY MODEL, AND ITS RELATION TO COHEN KAPPA [J].
AICKIN, M .
BIOMETRICS, 1990, 46 (02) :293-302
[3]  
BAKER RJ, 1978, GLIM SYSTEM
[4]   USING REPLICATE OBSERVATIONS IN OBSERVER AGREEMENT STUDIES WITH BINARY ASSESSMENTS [J].
BAKER, SG ;
FREEDMAN, LS ;
PARMAR, MKB .
BIOMETRICS, 1991, 47 (04) :1327-1338
[5]  
BEGG CB, 1986, J CHRON DIS, V39, P575
[6]  
Bishop Y.M., 1977, DISCRETE MULTIVARIAT
[7]   2X2 KAPPA-COEFFICIENTS - MEASURES OF AGREEMENT OR ASSOCIATION [J].
BLOCH, DA ;
KRAEMER, HC .
BIOMETRICS, 1989, 45 (01) :269-287
[8]   COEFFICIENT KAPPA - SOME USES, MISUSES, AND ALTERNATIVES [J].
BRENNAN, RL ;
PREDIGER, DJ .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1981, 41 (03) :687-699
[9]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[10]  
Dawid A.P., 1979, J R STAT SOC C-APPL, V28, P20, DOI DOI 10.2307/2346806