The prevalence of negative studies with inadequate statistical power: An analysis of the plastic surgery literature

被引:52
作者
Chung, KC
Kalliainen, LK
Spilson, SV
Walters, MR
Kim, HM
机构
[1] Univ Michigan, Med Ctr, Sect Plast Surg, Hand Ctr,Dept Surg, Ann Arbor, MI 48109 USA
[2] St Joseph Mercy Hosp, Ann Arbor, MI 48104 USA
关键词
D O I
10.1097/00006534-200201000-00001
中图分类号
R61 [外科手术学];
学科分类号
摘要
Studies published in the medical literature often neglect to consider the statistical power needed to detect it meaningful difference between study groups. Small sample sizes tend to produce negative results because of low statistical power. Studies that cannot make conclusive statements about their hypotheses can waste resources, deter further research, and impede advances in clinical treatment. The Current study reviewed three of the most frequently read plastic surgery journals from 1976 to 1996 to determine the prevalence of inadequately (<80 percent) powered clinical trials and experimental studies that found no difference (negative studies) in the response variable of interest between comparison groups. The statistical power of 54 negative studies using continuous response variables was calculated to detect a difference of 1 SD (+/-1 SD) in means between the comparative groups. The power of another 57 negative studies with dichotomous response (yes/no) variables was calculated to detect a relative change in proportions of 25 percent and 50 percent from the experimental to the control group. It was found that 85 percent of the studies with continuous response variables had inadequate power to detect the desired mean difference of 1 SD. In studies with dichotomous response variables, 98 percent had inadequate power to detect a desired 25 percent relative change in proportions, and 74 percent had inadequate power to detect a desired 50 percent relative change in proportions. These results indicate that many of the studies in the plastic surgery literature lack adequate power to detect a mode rate-to-large difference between groups. The lack of power makes the interpretation of the Studies with negative findings inconclusive. Proper study design dictates that investigators consider a priori the difference between groups that is of clinical interest, and the sample size per group that is needed to provide adequate statistical power to detect the desired difference.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 17 条
[1]   THE SCANDAL OF POOR MEDICAL-RESEARCH [J].
ALTMAN, DG .
BRITISH MEDICAL JOURNAL, 1994, 308 (6924) :283-284
[2]   Type II (β) errors in the hand literature:: The importance of power [J].
Chung, KC ;
Kalliainen, LK ;
Hayward, RA .
JOURNAL OF HAND SURGERY-AMERICAN VOLUME, 1998, 23A (01) :20-25
[3]  
CLEGG F, 1988, BRIT J HOSP MED, V40, P396
[4]  
COHEN J, 1977, STAT POWER ANAL BEHA, P5
[5]   EVALUATING THE MEDICAL LITERATURE .2. STATISTICAL-ANALYSIS [J].
ELENBAAS, RM ;
ELENBAAS, JK ;
CUDDY, PG .
ANNALS OF EMERGENCY MEDICINE, 1983, 12 (10) :610-620
[6]  
Fleiss JL., 1981, STAT METHODS RATES P, V2nd, P33
[7]   ABSOLUTELY RELATIVE - HOW RESEARCH RESULTS ARE SUMMARIZED CAN AFFECT TREATMENT DECISIONS [J].
FORROW, L ;
TAYLOR, WC ;
ARNOLD, RM .
AMERICAN JOURNAL OF MEDICINE, 1992, 92 (02) :121-124
[8]   IMPORTANCE OF BETA, TYPE-II ERROR AND SAMPLE-SIZE IN DESIGN AND INTERPRETATION OF RANDOMIZED CONTROL TRIAL - SURVEY OF 71 NEGATIVE TRIALS [J].
FREIMAN, JA ;
CHALMERS, TC ;
SMITH, H ;
KUEBLER, RR .
NEW ENGLAND JOURNAL OF MEDICINE, 1978, 299 (13) :690-694
[9]  
HELBERG C, 1995, PITFALLS DATA ANAL A
[10]  
HENNEKENS CH, 1987, EPIDEMIOLOGY MED, P182