Sample-size calculations for Cohen's kappa

被引:162
作者
Cantor, AB
机构
[1] H. Lee Moffitt Cancer Center, Research Institute, Tampa, FL 36122-9497
关键词
D O I
10.1037/1082-989X.1.2.150
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
In recent years, researchers in the psychosocial and biomedical sciences have become increasingly aware of the importance of sample-size calculations in the design of research projects. Such considerations are, however, rarely applied for studies involving agreement of raters. Published results on this topic are limited and generally provide rather complex formulas. In addition, they generally make the assumption that the raters have the same set of frequencies for the possible ratings. In this article I show that for the case of 2 raters and 2 possible ratings the assumptions of equal frequencies can be dropped. Tables that allow for almost immediate sample-size determination for a variety of common study designs are given.
引用
收藏
页码:150 / 153
页数:4
相关论文
共 11 条
[1]  
AKERN RP, 1995, BRIT J UROL, V75, P5
[3]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[4]  
Cohen J., 1988, Statistical Power Analysis for the Behavioral Sciences, V2
[5]   A GOODNESS-OF-FIT APPROACH TO INFERENCE PROCEDURES FOR THE KAPPA-STATISTIC - CONFIDENCE-INTERVAL CONSTRUCTION, SIGNIFICANCE-TESTING AND SAMPLE-SIZE ESTIMATION [J].
DONNER, A ;
ELIASZIW, M .
STATISTICS IN MEDICINE, 1992, 11 (11) :1511-1519
[6]   SAMPLE-SIZE DETERMINATIONS FOR THE 2 RATER-KAPPA STATISTIC [J].
FLACK, VF ;
AFIFI, AA ;
LACHENBRUCH, PA ;
SCHOUTEN, HJA .
PSYCHOMETRIKA, 1988, 53 (03) :321-325
[7]   LARGE SAMPLE STANDARD ERRORS OF KAPPA AND WEIGHTED KAPPA [J].
FLEISS, JL ;
COHEN, J ;
EVERITT, BS .
PSYCHOLOGICAL BULLETIN, 1969, 72 (05) :323-&
[8]  
FLEISS JL, 1971, PSYCHOL BULL, V76, P378, DOI 10.1037/h0031619
[9]   INTRODUCTION TO SAMPLE-SIZE DETERMINATION AND POWER ANALYSIS FOR CLINICAL-TRIALS [J].
LACHIN, JM .
CONTROLLED CLINICAL TRIALS, 1981, 2 (02) :93-113
[10]   MEASURING INTERRATER RELIABILITY AMONG MULTIPLE RATERS - AN EXAMPLE OF METHODS FOR NOMINAL DATA [J].
POSNER, KL ;
SAMPSON, PD ;
CAPLAN, RA ;
WARD, RJ ;
CHENEY, FW .
STATISTICS IN MEDICINE, 1990, 9 (09) :1103-1115