Real-time feedback on rater drift in constructed-response items: An example from the golden state examination

被引:30
作者
Hoskens, M [1 ]
Wilson, M [1 ]
机构
[1] Univ Calif Berkeley, Sch Educ, Berkeley, CA 94720 USA
关键词
D O I
10.1111/j.1745-3984.2001.tb01119.x
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
In this study, patterns of variation in severities of a group of raters over time or so-called "rater drift" was examined when raters scored an essay written under examination conditions. At the same time feedback was given to rater leaders (called "table leaders") who then interpreted the feedback and reported to the raters. Rater severities in five successive periods were estimated using a modified linear logistic test model (LLTM, Fischer 1973) approach. It was found that the raters did indeed drift towards the mean, but a planned comparision of the feedback with a control condition was not successful; it was believed that this was due to contamination at the table leader level. A series of models was also estimated designed to detect other types of rater effects beyond severity: a tendency to use extreme scores, and tendency to prefer certain categories. The models for these effects were found to be showing significant improvement in fit, implying that these effects were indeed present, although they were difficult to detect in relatively short time periods.
引用
收藏
页码:121 / 145
页数:25
相关论文
共 30 条
[1]  
ANDRICH A, 1985, SOCIOL METHODOL, P33
[2]  
[Anonymous], 2000, OBJECTIVE MEASUREMEN
[3]  
[Anonymous], MODELS UNCERTAINTY E
[4]  
[Anonymous], 1997, PSYCHOL TESTING PRIN
[5]  
ASHBURN R, 1963, J EXPT ED, V7, P1
[6]  
BRAUN HI, 1988, J ED STAT, V13
[7]  
BRYK AS, 1996, HLM VERSION 4
[8]  
CRAVEN RG, 1999, UNPUB DIFFUSION EFFE
[9]   Reliability of repeated grading of essay type examinations [J].
Eells, WC .
JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1930, 21 :48-52