Graphical models and computerized adaptive testing

被引:45
作者
Almond, RG [1 ]
Mislevy, RJ [1 ]
机构
[1] Educ Testing Serv, Princeton, NJ 08541 USA
关键词
adaptive testing; Bayesian nets; computerized adaptive testing; graphical models; item response theory;
D O I
10.1177/01466219922031347
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Computerized adaptive testing (CAT) based on item response theory (IRT) is viewed from the perspective of graphical modeling (GM). GM provides methods for making inferences about multifaceted skills and knowledge, and for extracting data from complex performances. However, simply incorporating variables for all sources of variation is rarely successful. Thus, researchers must closely analyze the substance and structure of the problem to create more effective models. Researchers regularly employ sophisticated strategies to handle many sources of variability outside the IRT model. Relevant variables can play many roles without appearing in the operational IRT model per se, e.g., in validity studies, assembling tests, and constructing and modeling tasks. Some of these techniques are described from a GM perspective, as well as how to extend them to more complex assessment situations. Issues are illustrated in the context of language testing.
引用
收藏
页码:223 / 237
页数:15
相关论文
共 38 条
[1]   The multidimensional random coefficients multinomial logit model [J].
Adams, RJ ;
Wilson, M ;
Wang, WC .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1997, 21 (01) :1-23
[2]   BAYESIAN-ESTIMATION OF NORMAL OGIVE ITEM RESPONSE CURVES USING GIBBS SAMPLING [J].
ALBERT, JH .
JOURNAL OF EDUCATIONAL STATISTICS, 1992, 17 (03) :251-269
[3]  
Almond R.G., 1995, Graphical belief modeling
[4]  
[Anonymous], BUGS 0 5 EXAMPLES
[5]  
Bachman L. F., 1990, Fundamental Considerations in Language Testing
[6]  
Bachman Lyle., 2010, LANGUAGE ASSESSMENT
[7]   A GENERATIVE ANALYSIS OF A 3-DIMENSIONAL SPATIAL TASK [J].
BEJAR, II .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1990, 14 (03) :237-245
[8]  
BERGER MPF, 1996, OBJECTIVE MEASUREMEN, V3
[9]  
BRADLOW ET, 1998, RR983 ED TEST SERV
[10]  
BREESE JS, 1994, IEEE T SYST MAN CYB, V24, P1577