Controlling error in multiple comparisons, with examples from state-to-state differences in educational achievement

被引:114
作者
Williams, VSL
Jones, LV
Tukey, JW
机构
[1] Univ N Carolina, LL Thurstone Psychometr Lab, Dept Psychol, Chapel Hill, NC 27599 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
关键词
educational assessment; hypothesis testing; multiple comparisons; National Assessment of Educational Progress (NAEP);
D O I
10.3102/10769986024001042
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Three alternative procedures to adjust significance levels for multiplicity are the traditional Bonferroni technique, a sequential Bonferroni technique developed by Hochberg (1988), and a sequential approach for controlling the false discovery rate proposed by Benjamini and Hochberg (1995). These procedures are illustrated and compared using examples from the National Assessment of Educational Progress (NAEP). A prominent advantage of the Benjamini and Hochberg (B-H) procedure, as demonstrated in these examples, is the greater invariance of statistical significance for given comparisons over alternative family sizes. Simulation studies show that all three procedures maintain a false discovery rate bounded above, often grossly, by alpha (or alpha/2). For both uncorrelated and pairwise families of comparisons, the B-H technique is shown to have greater power than the Hochberg or Bonferroni procedures, and its power remains relatively stable as the number of comparisons becomes large, giving it an increasing advantage when many comparisons are involved. We recommend that results from NAEP State Assessments be reported using the B-H technique rather than the Bonferroni procedure.
引用
收藏
页码:42 / 69
页数:28
相关论文
共 11 条