On Bayesian analysis of multirater ordinal data: An application to automated essay grading

被引:51
作者
Johnson, VE
机构
关键词
Bayesian inference; categorical data; Gibbs sampling; hierarchical models; latent structure models; Markov chain Monte Carlo;
D O I
10.2307/2291381
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A framework is proposed for the analysis of ordinal categorical data when ratings from several judges are available. I emphasize the tasks of estimating latent trait characteristics of individual items, regressing these latent traits on observed covariates, and comparing the performance of raters. The model is illustrated in the design and evaluation of an automated essay grader. This grader is based on a regression of variables, obtained from a grammar checker, on essay scores estimated from a panel of experts. The performance of the grader is evaluated relative to human graders, and implications on the reliability and repeatability of both automated and human raters is investigated.
引用
收藏
页码:42 / 51
页数:10
相关论文
共 11 条
[1]   BAYESIAN-ESTIMATION OF NORMAL OGIVE ITEM RESPONSE CURVES USING GIBBS SAMPLING [J].
ALBERT, JH .
JOURNAL OF EDUCATIONAL STATISTICS, 1992, 17 (03) :251-269
[2]   BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA [J].
ALBERT, JH ;
CHIB, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) :669-679
[3]   MARGINAL MAXIMUM-LIKELIHOOD ESTIMATION OF ITEM PARAMETERS - APPLICATION OF AN EM ALGORITHM [J].
BOCK, RD ;
AITKIN, M .
PSYCHOMETRIKA, 1981, 46 (04) :443-459
[4]   SAMPLING-BASED APPROACHES TO CALCULATING MARGINAL DENSITIES [J].
GELFAND, AE ;
SMITH, AFM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (410) :398-409
[5]  
MCCULLAGH P, 1980, J ROY STAT SOC B MET, V42, P109
[7]   THE CALCULATION OF POSTERIOR DISTRIBUTIONS BY DATA AUGMENTATION [J].
TANNER, MA ;
WING, HW .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (398) :528-540
[8]   MODELING AGREEMENT AMONG RATERS [J].
TANNER, MA ;
YOUNG, MA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1985, 80 (389) :175-180
[9]   MODELING APPROACHES FOR THE ANALYSIS OF OBSERVER AGREEMENT [J].
UEBERSAX, JS .
INVESTIGATIVE RADIOLOGY, 1992, 27 (09) :738-743
[10]   A LATENT TRAIT FINITE MIXTURE MODEL FOR THE ANALYSIS OF RATING AGREEMENT [J].
UEBERSAX, JS ;
GROVE, WM .
BIOMETRICS, 1993, 49 (03) :823-835