Multiple comparison procedures updated

被引:516
作者
Ludbrook, J [1 ]
机构
[1] Univ Melbourne, Royal Melbourne Hosp, Dept Surg, Parkville, Vic, Australia
关键词
Bonferroni; Dunnett; families of hypotheses; Hochberg; Holm; Shaffer; Sidak; Simes; simultaneous inference; step-wise procedures; Tukey-Kramer;
D O I
10.1111/j.1440-1681.1998.tb02179.x
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
1. A common statistical flaw in articles submitted to or published in biomedical research journals is to test multiple null hypotheses that originate from the results of a single experiment without correcting for the inflated risk of type 1 error (false positive statistical inference) that results from this. Multiple comparison procedures (MCP) are designed to minimize this risk. The present review focuses on pairwise contrasts, the most common sort of multiple comparisons made by biomedical investigators, 2. In an earlier review a variety of MCP were described and evaluated. It was concluded that an effective MCP should control the risk of family-wise type I error, so as to ensure that not more than one hypothesis within a single family is falsely rejected. One-step procedures based on the Bonferroni or Sidak inequalities do this. For continuous data and under normal distribution theory, so does the Tukey-Kramer procedure for all possible pairwise contrasts of means and the Dunnett procedure for all possible pairwise contrasts of means with a control mean. 3. There is now a new class of MCP, based on the Bonferroni or Sidak inequalities but performed in a step-wise fashion. The members of this class have certain desirable properties. They: (1) control the family-wise type I error rate as effectively as the one-step procedures; (ii) are mom powerful than the one-step Bonferroni or Sidak procedures, especially when hypotheses are logically related; and (iii) can be applied not only to continuous data but also to ordinal or categorical data. 4. Of the new step-wise MCP, Helm's step-down procedures are commended for their combination of accuracy, power and versatility. They also have the virtue of simplicity. Given the raw P values that result from conventional tests of significance, the adjustments for multiple comparisons can be made by hand or hand-held calculator. 5, Despite the corrective abilities of the new step-wise MCP, investigators should try to design their experiments and analyses to test a single, global hypothesis rather than multiple ones.
引用
收藏
页码:1032 / 1037
页数:6
相关论文
共 40 条
[1]  
[Anonymous], 1993, Resampling-based multiple testing: Examples and methods for P-value adjustment
[2]  
[Anonymous], 1993, MULTIPLE COMP PROCED
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]  
Brown BW, 1997, STAT MED, V16, P2511, DOI 10.1002/(SICI)1097-0258(19971130)16:22<2511::AID-SIM693>3.0.CO
[5]  
2-4
[6]   THE CORRELATION AND DEPENDENCE BETWEEN 2 F-STATISTICS WITH THE SAME DENOMINATOR [J].
FEINGOLD, M ;
KORSOG, PE .
AMERICAN STATISTICIAN, 1986, 40 (03) :218-220
[7]  
FREEMAN GH, 1951, BIOMETRIKA, V38, P141, DOI 10.1093/biomet/38.1-2.141
[8]  
GLASS GV, 1972, REV EDUC RES, V42, P237, DOI 10.3102/00346543042003237
[9]   COMPARING THE MEANS OF SEVERAL GROUPS [J].
GODFREY, K .
NEW ENGLAND JOURNAL OF MEDICINE, 1985, 313 (23) :1450-1456
[10]   ROBUSTNESS OF T TEST - A GUIDE FOR RESEARCHERS ON EFFECT OF VIOLATIONS OF ASSUMPTIONS [J].
HAVLICEK, LL ;
PETERSON, NL .
PSYCHOLOGICAL REPORTS, 1974, 34 (03) :1095-1114