Evaluating the magnitude of differential item functioning in polytomous items

被引:54
作者
Zwick, R [1 ]
Thayer, DT [1 ]
机构
[1] EDUC TESTING SERV, RES STAT GRP, PRINCETON, NJ 08541 USA
关键词
differential item functioning; polytomous items; standardization DIF statistic;
D O I
10.3102/10769986021003187
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Several recent studies have investigated the application of statistical inference procedures to the analysis of differential item functioning (DIF) in polytomous test items that are scored on an ordinal scale. Mantel's extension of the Mantel-Haenszel test is one of several hypothesis-testing methods for this purpose. The development of descriptive statistics for characterizing DIF in polytomous rest items has received less attention. As a step in this direction, two possible standard error formulas for the polytomous DIF index proposed by Dorans and Schmitt were derived. These standard errors, as well as associated hypothesis-testing procedures, were evaluated though application to simulated data. The standard error that performed better is based on Mantel's hypergeometric model. The alternative standard error, based on a multinomial model, tended to yield values thar were too small.
引用
收藏
页码:187 / 201
页数:15
相关论文
共 32 条
[1]  
Agresti A., 1990, Analysis of categorical data
[2]  
[Anonymous], J AM STAT ASSOC
[3]  
CHANG H, 1995, 955 ED TEST SERV
[4]   THE UNIQUE CORRESPONDENCE OF THE ITEM RESPONSE FUNCTION AND ITEM CATEGORY RESPONSE FUNCTIONS IN POLYTOMOUSLY SCORED ITEM RESPONSE MODELS [J].
CHANG, HH ;
MAZZEO, J .
PSYCHOMETRIKA, 1994, 59 (03) :391-404
[5]  
Donoghue J. R., 1993, Differential item functioning, P137
[6]  
DORANS NJ, 1993, CONSTRUCTION VERSUS CHOICE IN COGNITIVE MEASUREMENT : ISSUES IN CONSTRUCTED RESPONSE, PERFORMANCE TESTING, AND PORTFOLIO ASSESSMENT, P135
[7]   DEMONSTRATING THE UTILITY OF THE STANDARDIZATION APPROACH TO ASSESSING UNEXPECTED DIFFERENTIAL ITEM PERFORMANCE ON THE SCHOLASTIC APTITUDE-TEST [J].
DORANS, NJ ;
KULICK, E .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1986, 23 (04) :355-368
[8]  
DORANS NJ, 1991, 9147 ETS
[9]  
GLAS CAW, 1991, 912 MEAS RES DEP
[10]  
Holland P. W., 1988, TEST VALIDITY, P129, DOI [DOI 10.1017/CBO9781107415324.004, 10.1002/j.2330-8516.1986.tb00186.x]