Test item development: Validity evidence from quality assurance procedures

被引:41
作者
Downing, SM [1 ]
Haladyna, TM [1 ]
机构
[1] ARIZONA STATE UNIV W,COLL EDUC,PHOENIX,AZ 85069
关键词
D O I
10.1207/s15324818ame1001_4
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Validity of high-stakes examination scores involves collecting and organizing evidence to support a specific test score interpretation or use in some specific context. A primary type of validity evidence derives from the item development process and item responses. An ideal process is identified that documents how items are developed and how responses to the items are studied to ensure that the basic building blocks of tests, test items, are sound. This ideal process includes both qualitative and quantitative forms of evidence. A checklist is provided to help high-stakes examination developers assess the types and quality of item-level validity evidence required in validation.
引用
收藏
页码:61 / 82
页数:22
相关论文
共 45 条
[11]   THE ROLE OF INSTRUCTIONAL SENSITIVITY IN THE EMPIRICAL REVIEW OF CRITERION-REFERENCED TEST ITEMS [J].
HALADYNA, T ;
ROID, G .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1981, 18 (01) :39-53
[12]  
Haladyna T. M, 1994, DEV VALIDATING MULTI
[13]   HOW MANY OPTIONS IS ENOUGH FOR A MULTIPLE-CHOICE TEST ITEM [J].
HALADYNA, TM ;
DOWNING, SM .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1993, 53 (04) :999-1010
[14]  
HALADYNA TM, 1989, APPL MEAN EDUC, V1, P37
[15]  
HALADYNA TM, 1989, APPLIED MEASUREMENT, V1, P51
[16]  
HALADYNA TM, 1996, WRITING TEST ITEMS E
[17]   METHODOLOGY REVIEW - ASSESSING UNIDIMENSIONALITY OF TESTS AND ITEMS [J].
HATTIE, J .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1985, 9 (02) :139-164
[18]  
Henryson S., 1971, Educational Measurement, P130
[19]  
Holland PW, 1993, DIFFERENTIAL ITEM FU
[20]   Model-based practice analysis and test specifications [J].
Kane, M .
APPLIED MEASUREMENT IN EDUCATION, 1997, 10 (01) :5-18