Interrater agreement reconsidered:: An alternative to the rwg indices

被引:216
作者
Brown, RD [1 ]
Hauenstein, NMA
机构
[1] Western Kentucky Univ, Dept Psychol, Bowling Green, KY 42101 USA
[2] Virginia Polytech Inst & State Univ, Blacksburg, VA 24061 USA
关键词
interrater agreement; interrater reliability; kappa;
D O I
10.1177/1094428105275376
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
For continuous constructs, the most frequently used index of interrater agreement (r(wg(1))) can be problematic. Typically, r(wg(1)) is estimated with the assumption that a uniform distribution represents no agreement. The authors review the limitations of this uniform null r(wg(1)) index and discuss alternative methods for measuring interrater agreement. A new interrater agreement statistic, a(wg(1)), is proposed. The authors derive the a(wg(1)) statistic and demonstrate that a(wg(1)) is an analogue to Cohen's kappa, an interrater agreement index for nominal data. A comparison is made between agreement estimates based on the uniform r(wg(1)) and a(wg(1)), and issues such as minimum sample size andpractical significance levels are discussed. The authors close with recommendations regarding the use of r(wg(1))/r(wg(J)) when a uniform null is assumed, indices that do not assume a uniform null, a(wg(1))/a(wg(J)) indices, and generalizability estimates of interrater agreement.
引用
收藏
页码:165 / 184
页数:20
相关论文
共 52 条
[11]   Accurate tests of statistical significance for rWG and average deviation interrater agreement indexes [J].
Dunlap, WP ;
Burke, MJ ;
Smith-Crowe, K .
JOURNAL OF APPLIED PSYCHOLOGY, 2003, 88 (02) :356-362
[12]  
Eby LT, 1997, J ORGAN BEHAV, V18, P275, DOI 10.1002/(SICI)1099-1379(199705)18:3<275::AID-JOB796>3.0.CO
[13]  
2-C
[14]   BEYOND ATTRIBUTION THEORY - COGNITIVE-PROCESSES IN PERFORMANCE-APPRAISAL [J].
FELDMAN, JM .
JOURNAL OF APPLIED PSYCHOLOGY, 1981, 66 (02) :127-148
[15]   A NOTE ON ESTIMATING RELIABILITY OF CATEGORICAL DATA [J].
FINN, RH .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1970, 30 (01) :71-&
[16]   Interrater reliability and agreement of performance ratings: A methodological comparison [J].
Fleenor, JW ;
Fleenor, JB ;
Grossnickle, WF .
JOURNAL OF BUSINESS AND PSYCHOLOGY, 1996, 10 (03) :367-380
[17]   PERSONALITY, AFFECT, AND BEHAVIOR IN GROUPS [J].
GEORGE, JM .
JOURNAL OF APPLIED PSYCHOLOGY, 1990, 75 (02) :107-116
[18]   RATING ABILITY IN PERFORMANCE JUDGMENTS - THE JOINT INFLUENCE OF IMPLICIT THEORIES AND INTELLIGENCE [J].
HAUENSTEIN, NMA ;
ALEXANDER, RA .
ORGANIZATIONAL BEHAVIOR AND HUMAN DECISION PROCESSES, 1991, 50 (02) :300-323
[19]   Rater bias in psychological research: When is it a problem and what can we do about it? [J].
Hoyt, WT .
PSYCHOLOGICAL METHODS, 2000, 5 (01) :64-86
[20]   An examination of the relationship between work group characteristics and performance: Once more into the breech [J].
Hyatt, DE ;
Ruddy, TM .
PERSONNEL PSYCHOLOGY, 1997, 50 (03) :553-585