Application of the bi-factor multidimensional item response theory model to testlet-based tests

被引:117
作者
DeMars, Christine E. [1 ]
机构
[1] James Madison Univ, Ctr Assessment & Res, Harrisonburg, VA 22807 USA
关键词
D O I
10.1111/j.1745-3984.2006.00010.x
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Four item response theory (IRT) models were compared using data from tests where multiple items were grouped into testlets focused on a common stimulus. In the bi-factor model each item was treated as a function of a primary trait plus a nuisance trait due to the testlet; in the testlet-effects model the slopes in the direction of the testlet traits were constrained within each testlet to be proportional to the slope in the direction of the primary trait; in the polytomous model the item scores were summed into a single score for each testlet; and in the independent-items model the testlet structure was ignored. Using the simulated data, reliability was overestimated somewhat by the independent-items model when the items were not independent within testlets. Under these nonindependent conditions, the independent-items model also yielded greater root mean square error (RMSE) for item difficulty and underestimated the item slopes. When the items within testlets were instead generated to be independent, the bi-factor model yielded somewhat higher RMSE in difficulty and slope. Similar differences between the models were illustrated with real data.
引用
收藏
页码:145 / 168
页数:24
相关论文
共 32 条
  • [1] ACKERMAN TA, 1987, ANN M AM ED RES ASS
  • [2] Adams R., 2002, PISA 2000 technical report
  • [3] [Anonymous], WINBUGS 1 4 COMPUTER
  • [4] BOCK RD, 2003, TESTFACT 4 0 COMPUTE
  • [5] A Bayesian random effects model for testlets
    Bradlow, ET
    Wainer, H
    Wang, XH
    [J]. PSYCHOMETRIKA, 1999, 64 (02) : 153 - 168
  • [6] Cook K F, 1999, J Outcome Meas, V3, P1
  • [7] DRESHER AR, 2004, ANN M NAT COUNC MEAS
  • [8] du Toit M., 2003, IRT SSI BILOG MG MUL
  • [9] FULL-INFORMATION ITEM BIFACTOR ANALYSIS
    GIBBONS, RD
    HEDEKER, DR
    [J]. PSYCHOMETRIKA, 1992, 57 (03) : 423 - 436
  • [10] Glas C. A., 2000, COMPUTERIZED ADAPTIV, P271, DOI DOI 10.1007/0-306-47531-6_14