Empirical Bayes and item-clustering effects in a latent variable hierarchical model: A case study from the national assessment of educational progress

被引:36
作者
Scott, SL [1 ]
Ip, EH [1 ]
机构
[1] Univ So Calif, Marshall Sch Business, Los Angeles, CA 90089 USA
关键词
educational testing; Gibbs sampler; item response theory; Markov chain Monte Carlo; multinomial logistic regression; psychometrics;
D O I
10.1198/016214502760046961
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Empirical Bayes regression procedures are often used in educational and psychological testing as extensions to latent variables models. The National Assessment of Educational Progress (NAEP) is an important national survey using such procedures. The NAEP applies empirical Bayes methods to models from item response theory to calibrate student responses to questions of varying difficulty. Due partially to the limited computing technology that existed when NAEP was first conceived, NAEP analyses are carried out using a two-stage estimation procedure that ignores uncertainty about some model parameters. Furthermore, the item response theory model that NAEP uses ignores the effect of item clustering created by the design of a test form. Using Markov chain Monte Carlo, we simultaneously estimate all parameters of an expanded model that considers item clustering to investigate the impact of item clustering and ignoring uncertainty about model parameters on an important outcome measure that NAEP report. Ignoring these two effects causes substantial underestimation of standard errors and induces a modest bias in location estimates.
引用
收藏
页码:409 / 419
页数:11
相关论文
共 50 条
[1]   Multilevel item response models: An approach to errors in variables regression [J].
Adams, RJ ;
Wilson, M ;
Wu, M .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 1997, 22 (01) :47-76
[2]   BAYESIAN-ESTIMATION OF NORMAL OGIVE ITEM RESPONSE CURVES USING GIBBS SAMPLING [J].
ALBERT, JH .
JOURNAL OF EDUCATIONAL STATISTICS, 1992, 17 (03) :251-269
[3]   BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA [J].
ALBERT, JH ;
CHIB, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) :669-679
[4]  
ALLEN NL, 2000, 2001509 NCES US DEP
[5]   Graphical models and computerized adaptive testing [J].
Almond, RG ;
Mislevy, RJ .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1999, 23 (03) :223-237
[6]  
[Anonymous], MODELS UNCERTAINTY E
[7]   Latent variable regression for multiple discrete outcomes [J].
Bandeen-Roche, K ;
Miglioretti, DL ;
Zeger, SL ;
Rathouz, PJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (440) :1375-1386
[8]  
Birnbaum A., 1968, STAT THEORIES MENTAL
[9]   A Bayesian random effects model for testlets [J].
Bradlow, ET ;
Wainer, H ;
Wang, XH .
PSYCHOMETRIKA, 1999, 64 (02) :153-168
[10]   A hierarchical latent variable model for ordinal data from a customer satisfaction survey with "no answer" responses [J].
Bradlow, ET ;
Zaslavsky, AM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (445) :43-52