Accurate tests of statistical significance for rWG and average deviation interrater agreement indexes

被引:162
作者
Dunlap, WP
Burke, MJ [1 ]
Smith-Crowe, K
机构
[1] Tulane Univ, AB Freeman Sch Business, New Orleans, LA 70118 USA
[2] Tulane Univ, Dept Psychol, New Orleans, LA 70118 USA
关键词
D O I
10.1037/0021-9010.88.2.356
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
The authors demonstrated that the most common statistical significance test used with r(WG)-type interrater agreement indexes in applied psychology, based on the chi-square distribution, is flawed and inaccurate. The chi-square test is shown to be extremely conservative even for modest, standard significance levels (e.g., .05). The authors present an alternative statistical significance test, based on. Monte Carlo procedures, that produces the equivalent of an approximate randomization test for the null hypothesis that the actual distribution of responding is rectangular and demonstrate its superiority to the chi-square test. Finally, the authors provide tables of critical values and offer downloadable software to implement the approximate randomization test,for r(WG)-type and for average deviation (AD)-type interrater agreement indexes. The implications of these results for studying a broad range of interrater agreement problems in applied psychology are discussed.
引用
收藏
页码:356 / 362
页数:7
相关论文
共 32 条
[1]   On Average Deviation Indices for Estimating Interrater Agreement [J].
Burke, Michael J. ;
Finkelstein, Lisa M. ;
Dusig, Michelle S. .
ORGANIZATIONAL RESEARCH METHODS, 1999, 2 (01) :49-68
[2]   Estimating interrater agreement with the average deviation index: A user's guide [J].
Burke, MJ ;
Dunlap, WP .
ORGANIZATIONAL RESEARCH METHODS, 2002, 5 (02) :159-172
[3]   Do situational variables act as substantive causes of relationships between individual difference variables? Two large-scale tests of ''common cause'' models [J].
Burke, MJ ;
Rupinski, MT ;
Dunlap, WP ;
Davison, HK .
PERSONNEL PSYCHOLOGY, 1996, 49 (03) :573-598
[4]   Organizational efforts to affirm sexual diversity: A cross-level examination [J].
Button, SB .
JOURNAL OF APPLIED PSYCHOLOGY, 2001, 86 (01) :17-28
[5]   Functional relations among constructs in the same content domain at different levels of analysis: A typology of composition models [J].
Chan, D .
JOURNAL OF APPLIED PSYCHOLOGY, 1998, 83 (02) :234-246
[6]   The influence of demographic heterogeneity on the emergence and consequences of cooperative norms in work teams [J].
Chatman, JA ;
Flynn, FJ .
ACADEMY OF MANAGEMENT JOURNAL, 2001, 44 (05) :956-974
[7]   Statistical properties of the rWG(J) index of agreement [J].
Cohen, A ;
Doveh, E ;
Eick, U .
PSYCHOLOGICAL METHODS, 2001, 6 (03) :297-310
[8]   The job demands-resources model of burnout [J].
Demerouti, E ;
Bakker, AB ;
Nachreiner, F ;
Schaufeli, WB .
JOURNAL OF APPLIED PSYCHOLOGY, 2001, 86 (03) :499-512
[9]   Trust in leadership and team performance: Evidence from NCAA basketball [J].
Dirks, KT .
JOURNAL OF APPLIED PSYCHOLOGY, 2000, 85 (06) :1004-1012
[10]   EXACT MULTINOMIAL PROBABILITIES FOR ONE-WAY CONTINGENCY-TABLES [J].
DUNLAP, WP ;
MYERS, L ;
SILVER, NC .
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1984, 16 (01) :54-56