Is it necessary to make anchor tests mini-versions of the tests being equated or can some restrictions be relaxed?

被引:52
作者
Sinharay, Sandip [1 ]
Holland, Paul W. [1 ]
机构
[1] Educ Testing Serv, Princeton, NJ 08541 USA
关键词
D O I
10.1111/j.1745-3984.2007.00037.x
中图分类号
G44 [教育心理学];
学科分类号
0402 [心理学]; 040202 [发展与教育心理学];
摘要
It is a widely held belief that anchor tests should be miniature versions (i.e., minitests), with respect to content and statistical characteristics, of the tests being equated. This article examines the foundations for this belief regarding statistical characteristics. It examines the requirement of statistical representativeness of anchor tests that are content representative. The equating performance of several types of anchor tests, including those having statistical characteristics that differ from those of the tests being equated, is examined through several simulation studies and a real data example. Anchor tests with a spread of item difficulties less than that of a total test seem to perform as well as a minitest with respect to equating bias and equating standard error. Hence, the results demonstrate that requiring an anchor test to mimic the statistical characteristics of the total test may be too restrictive and need not be optimal. As a side benefit, this article also provides a comparison of the equating performance of post-stratification equating and chain equipercentile equating.
引用
收藏
页码:249 / 275
页数:27
相关论文
共 22 条
[1]
Angoff W. H., 1968, COLL BOARD REV, V68, P11
[2]
ANGOFF WH, 1971, ED MEASUREMENT
[3]
[Anonymous], TEST EQUATING
[5]
PROBLEMS RELATED TO THE USE OF CONVENTIONAL AND ITEM RESPONSE THEORY EQUATING METHODS IN LESS THAN OPTIMAL CIRCUMSTANCES [J].
COOK, LL ;
PETERSEN, NS .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1987, 11 (03) :225-244
[6]
DAVEY T, 1997, 974 ACTT INC
[7]
DORANS NJ, 1998, GUIDELINES SELECTION
[8]
DORANS NJ, 1994, EQUATING ISSUES ENGE
[9]
FDA licences imatinib mesylate for CML [J].
Habeck, M .
LANCET ONCOLOGY, 2002, 3 (01) :6-6
[10]
Obtaining a common scale for item response theory item parameters using separate versus concurrent estimation in the common-item equating design [J].
Hanson, BA ;
Béguin, AA .
APPLIED PSYCHOLOGICAL MEASUREMENT, 2002, 26 (01) :3-24