Measuring interrater agreement for ratings of a single target

被引:66
作者
Lindell, MK
Brandt, CJ
机构
[1] GEORGE WASHINGTON UNIV,WASHINGTON,DC 20052
[2] MICHIGAN STATE UNIV,E LANSING,MI 48824
关键词
aggregation; cross-level analysis; interrater agreement;
D O I
10.1177/01466216970213006
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Researchers assessing interrater agreement for ratings of a single target have increasingly used the r(WG(j)) index, but have found it can display irregular behavior. Mathematical analyses show this problem arises from the use of random response, operationalized by the variance of a uniform distribution (s(EU)(2)), for the baseline of comparison. These analyses suggest that researchers should continue to use r(WG)(j) as a summary measure of interrater agreement, but should use maximum dissensus as a reference distribution for computing r(WG)(j). Although values of s(EU)(2) can be descriptively misleading, they provide an important inferential baseline. Thus, s(EU)(2) should be used in computing chi(2) tests Of the departure of the observed response variance from random responding. Researchers should also examine interrater agreement as a theoretical variable in its own right, investigating the causes and consequences of rater dissensus.
引用
收藏
页码:271 / 278
页数:8
相关论文
共 10 条
[1]   DEVIL RIDES AGAIN - CORRELATION AS AN INDEX OF FIT [J].
BIRNBAUM, MH .
PSYCHOLOGICAL BULLETIN, 1973, 79 (04) :239-242
[2]   THE NATURE OF THE DATA, OR HOW TO CHOOSE A CORRELATION-COEFFICIENT [J].
CARROLL, JB .
PSYCHOMETRIKA, 1961, 26 (04) :347-372
[3]   A NOTE ON ESTIMATING RELIABILITY OF CATEGORICAL DATA [J].
FINN, RH .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1970, 30 (01) :71-&
[4]   ORGANIZATIONAL-STRUCTURE - REVIEW OF STRUCTURAL DIMENSIONS AND THEIR CONCEPTUAL RELATIONSHIPS WITH INDIVIDUAL ATTITUDES AND BEHAVIOR [J].
JAMES, LR ;
JONES, AP .
ORGANIZATIONAL BEHAVIOR AND HUMAN PERFORMANCE, 1976, 16 (01) :74-113
[5]   R(WG) - AN ASSESSMENT OF WITHIN-GROUP INTERRATER AGREEMENT [J].
JAMES, LR ;
DEMAREE, RG ;
WOLF, G .
JOURNAL OF APPLIED PSYCHOLOGY, 1993, 78 (02) :306-309
[6]   ESTIMATING WITHIN-GROUP INTERRATER RELIABILITY WITH AND WITHOUT RESPONSE BIAS [J].
JAMES, LR ;
DEMAREE, RG ;
WOLF, G .
JOURNAL OF APPLIED PSYCHOLOGY, 1984, 69 (01) :85-98
[7]   A DISAGREEMENT ABOUT WITHIN-GROUP AGREEMENT - DISENTANGLING ISSUES OF CONSISTENCY VERSUS CONSENSUS [J].
KOZLOWSKI, SWJ ;
HATTRUP, K .
JOURNAL OF APPLIED PSYCHOLOGY, 1992, 77 (02) :161-167
[8]   INTERPRETATION OF R2 INDEX IN REGRESSION-MODELS OF JUDGMENT [J].
LINDELL, MK .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1978, 38 (01) :69-74
[9]   INTERRATER RELIABILITY COEFFICIENTS CANNOT BE COMPUTED WHEN ONLY ONE STIMULUS IS RATED [J].
SCHMIDT, FL ;
HUNTER, JE .
JOURNAL OF APPLIED PSYCHOLOGY, 1989, 74 (02) :368-370
[10]   INTERRATER RELIABILITY AND AGREEMENT OF SUBJECTIVE JUDGMENTS [J].
TINSLEY, HEA ;
WEISS, DJ .
JOURNAL OF COUNSELING PSYCHOLOGY, 1975, 22 (04) :358-376