Parametric versus non-parametric statistics in the analysis of randomized trials with non-normally distributed data

被引:272
作者
Vickers A.J. [1 ]
机构
[1] Integrative Medicine Service, Biostatistics Service, Memorial Sloan Kettering Cancer Center, New York, NY 10021, Howard 1312a
关键词
Relative Power; Change Score; Relative Efficiency; Parametric Method; Baseline Score;
D O I
10.1186/1471-2288-5-35
中图分类号
学科分类号
摘要
Background: It has generally been argued that parametric statistics should not be applied to data with non-normal distributions. Empirical research has demonstrated that Mann-Whitney generally has greater power than the t-test unless data are sampled from the normal. In the case of randomized trials, we are typically interested in how an endpoint, such as blood pressure or pain, changes following treatment. Such trials should be analyzed using ANCOVA, rather than t-test. The objectives of this study were: a) to compare the relative power of Mann-Whitney and ANCOVA; b) to determine whether ANCOVA provides an unbiased estimate for the difference between groups; c) to investigate the distribution of change scores between repeat assessments of a nonnormally distributed variable. Methods: Polynomials were developed to simulate five archetypal non-normal distributions for baseline and post-treatment scores in a randomized trial. Simulation studies compared the power of Mann-Whitney and ANCOVA for analyzing each distribution, varying sample size, correlation and type of treatment effect (ratio or shift). Results: Change between skewed baseline and post-treatment data tended towards a normal distribution. ANCOVA was generally superior to Mann-Whitney in most situations, especially where log-transformed data were entered into the model. The estimate of the treatment effect from ANCOVA was not importantly biased. Conclusion: ANCOVA is the preferred method of analyzing randomized trials with baseline and post-treatment measures. In certain extreme cases, ANCOVA is less powerful than Mann-Whitney. Notably, in these cases, the estimate of treatment effect provided by ANCOVA is of questionable interpretability. © 2005 Vickers; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 20 条
[1]  
Altman D.G., Practical Statistics for Medical Research, (1991)
[2]  
Jekel J.F., Katz D.L., Elmore J.G., Epidemiology, Biostatistics and Preventive Medicine, (2001)
[3]  
Heeren T., D'Agostino R., Robustness of the two independent samples t-test when applied to ordinal scaled data, Stat Med, 6, pp. 79-90, (1987)
[4]  
Sawilowsky S.S., Comments on using alternative to normal theory statistics in social and behavioural science, Canadian Psychology, 34, pp. 432-439, (1993)
[5]  
Zimmerman D.W., Zumbo B.D., The effect of outliers on the relative power of parametric and nonparametric statistical tests, Perceptual and Motor Skills, 71, pp. 339-349, (1990)
[6]  
Sawilowsky S.S., Clifford-Blair R., A more realistic look at the robustness and Type II error properties of the t test to departures from population normality, Psychological Bulletin, 111, pp. 352-360, (1992)
[7]  
Bridge P.D., Sawilowsky S.S., Increasing physicians' awareness of the impact of statistics on research outcomes: Comparative power of the t-test and and Wilcoxon Rank-Sum test in small samples applied research, J Clin Epidemiol, 52, pp. 229-235, (1999)
[8]  
Micceri T., The unicorn, the normal curve, and other improbable creatures, Psychological Bulletin, 105, pp. 156-166, (1989)
[9]  
Senn S., Statistical Issues in Drug Development, (1997)
[10]  
Vickers A.J., The use of percentage change from baseline as an outcome in a controlled trial is statistically inefficient: A simulation study, BMC Med Res Methodol, 1, (2001)