Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test

被引：541

作者：

Brookes, ST

Whitely, E

Egger, M

Smith, GD

Mulheran, PA

Peters, TJ

机构：

[1] Univ Bristol, Dept Social Med, Bristol BS8 2PR, Avon, England

[2] Univ Bern, Dept Social & Prevent Med, CH-3012 Bern, Switzerland

[3] Univ Reading, Dept Phys, Reading RG6 2AF, Berks, England

[4] Univ Bristol, Div Primary Hlth Care, Bristol BS6 6JL, Avon, England

来源：

JOURNAL OF CLINICAL EPIDEMIOLOGY | 2004年 / 57卷 / 03期

关键词：

subgroup analyses; subgroup-specific tests; interaction test; sample size; power;

D O I：

10.1016/j.jclinepi.2003.08.009

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Objective: Despite guidelines recommending the use of formal tests of interaction in subgroup analyses in clinical trials, inappropriate subgroup-specific analyses continue. Moreover, trials designed to detect overall treatment effects have limited power to detect treatment-subgroup interactions. This article quantifies the error rates associated with subgroup analyses. Study Design and Setting: Simulations quantified the risks of misinterpreting subgroup analyses as evidence of differential subgroup effects and the limited power of the interaction test in trials designed to detect overall treatment effects. Results: Although formal interaction tests performed as expected with respect to false positives, subgroup-specific tests were considerably less reliable: A significant effect in one subgroup only was observed in 7% to 64% of simulations depending on trial characteristics. Regarding power of the interaction test, a trial with 80% power for the overall effect had only 29% power to detect an interaction effect of the same magnitude. For interactions of this size to be detected with the same power as the overall effect, sample sizes should be inflated fourfold. increasing dramatically for interactions smaller than 20% of the overall effect. Conclusion: Although it is generally recognized that subgroup analyses can produce spurious results, the extent of the problem may be underestimated. (C) 2004 Elsevier Inc. All rights reserved.

引用

页码：229 / 236

页数：8

共 31 条

[1] [Anonymous], 1999, STAT MED, V18, P1905
[2] [Anonymous], 1992, Medical uses of statistics, DOI DOI 10.1201/9780429187445
[3] Subgroup analysis and other (mis)uses of baseline data in clinical trials
Assmann, SF
Pocock, SJ
Enos, LE
Kasten, LE
[J]. LANCET, 2000, 355 (9209) : 1064 - 1069
[4] BRESLOW N, 1980, STAT METH CANC RES, V1
[5] Brookes S T, 2001, Health Technol Assess, V5, P1
[6] BULPITT CJ, 1988, LANCET, V2, P31
[7] Clarke M, 1998, LANCET, V351, P1451
[8] Meta-analysis - Bias in location and selection of studies
Egger, M
Smith, GD
[J]. BMJ-BRITISH MEDICAL JOURNAL, 1998, 316 (7124): : 61 - 66
[9] García-Closas M, 1999, AM J EPIDEMIOL, V149, P689
[10] Power calculations for detecting interaction in stratified 2 x 2 tables
Gardiner, J
Pathak, D
Indurkhya, A
[J]. STATISTICS & PROBABILITY LETTERS, 1999, 41 (03) : 267 - 275

← 1 2 3 4 →