Empirical Bayes and item-clustering effects in a latent variable hierarchical model: A case study from the national assessment of educational progress

被引:36
作者
Scott, SL [1 ]
Ip, EH [1 ]
机构
[1] Univ So Calif, Marshall Sch Business, Los Angeles, CA 90089 USA
关键词
educational testing; Gibbs sampler; item response theory; Markov chain Monte Carlo; multinomial logistic regression; psychometrics;
D O I
10.1198/016214502760046961
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Empirical Bayes regression procedures are often used in educational and psychological testing as extensions to latent variables models. The National Assessment of Educational Progress (NAEP) is an important national survey using such procedures. The NAEP applies empirical Bayes methods to models from item response theory to calibrate student responses to questions of varying difficulty. Due partially to the limited computing technology that existed when NAEP was first conceived, NAEP analyses are carried out using a two-stage estimation procedure that ignores uncertainty about some model parameters. Furthermore, the item response theory model that NAEP uses ignores the effect of item clustering created by the design of a test form. Using Markov chain Monte Carlo, we simultaneously estimate all parameters of an expanded model that considers item clustering to investigate the impact of item clustering and ignoring uncertainty about model parameters on an important outcome measure that NAEP report. Ignoring these two effects causes substantial underestimation of standard errors and induces a modest bias in location estimates.
引用
收藏
页码:409 / 419
页数:11
相关论文
共 50 条
[31]  
Lord F. M., 1968, Statistical theories of mental test scores
[32]   EQUATION OF STATE CALCULATIONS BY FAST COMPUTING MACHINES [J].
METROPOLIS, N ;
ROSENBLUTH, AW ;
ROSENBLUTH, MN ;
TELLER, AH ;
TELLER, E .
JOURNAL OF CHEMICAL PHYSICS, 1953, 21 (06) :1087-1092
[33]   RANDOMIZATION-BASED INFERENCE ABOUT LATENT-VARIABLES FROM COMPLEX SAMPLES [J].
MISLEVY, RJ .
PSYCHOMETRIKA, 1991, 56 (02) :177-196
[34]   ESTIMATION OF LATENT GROUP EFFECTS [J].
MISLEVY, RJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1985, 80 (392) :993-997
[35]  
MORRIS CN, 1983, J AM STAT ASSOC, V78, P47, DOI 10.2307/2287098
[36]   A GENERALIZED PARTIAL CREDIT MODEL - APPLICATION OF AN EM ALGORITHM [J].
MURAKI, E .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1992, 16 (02) :159-176
[37]  
Patz RJ, 1999, J EDUC BEHAV STAT, V24, P146, DOI 10.2307/1165199
[38]  
PATZ RJ, 2000, 712 CARN MELL U DEPT
[39]  
Rubin DonaldB., 1987, MULTIPLE IMPUTATIONS
[40]  
Samejima F, 1969, PSYCHOMETRIKA MONO S, V17