Data splitting as a countermeasure against hypothesis fishing: with a case study of predictors for low back pain

被引：35

作者：

Dahl, Fredrik A. ^{[1
]}

Grotle, Margreth ^{[2
,3
]}

Benth, Jurate Saltyte ^{[4
]}

Natvig, Bard ^{[5
]}

机构：

[1] Aker Univ Hosp, Helse Sor Ost Hlth Serv Res Ctr, N-1474 Lorenskog, Norway

[2] Diakonhjemmet Hosp, Dept Rheumatol, Natl Resource Ctr Rehabilitat Rheumatol, N-0319 Oslo, Norway

[3] Ullevaal Univ Hosp, Div Neurosci & Musculoskeletal Med, FORMI Sect, N-0407 Oslo, Norway

[4] Univ Oslo, Helse Sor Ost Hlth Serv Res Ctr, N-1474 Lorenskog, Norway

[5] Univ Oslo, Fac Med, Inst Gen Practice & Community Hlth, Sect Occupat Hlth & Social Insurance Med, N-0318 Oslo, Norway

来源：

EUROPEAN JOURNAL OF EPIDEMIOLOGY | 2008年 / 23卷 / 04期

关键词：

data splitting; hypothesis fishing; data dredging; two-stage analysis; low back pain;

D O I：

10.1007/s10654-008-9230-x

中图分类号：

R1 [预防医学、卫生学];

学科分类号：

1004 ; 120402 ;

摘要：

There is growing concern in the scientific community that many published scientific findings may represent spurious patterns that are not reproducible in independent data sets. A reason for this is that significance levels or confidence intervals are often applied to secondary variables or sub-samples within the trial, in addition to the primary hypotheses (multiple hypotheses). This problem is likely to be extensive for population-based surveys, in which epidemiological hypotheses are derived after seeing the data set (hypothesis fishing). We recommend a data-splitting procedure to counteract this methodological problem, in which one part of the data set is used for identifying hypotheses, and the other is used for hypothesis testing. The procedure is similar to two-stage analysis of microarray data. We illustrate the process using a real data set related to predictors of low back pain at 14-year follow-up in a population initially free of low back pain. "Widespreadness" of pain (pain reported in several other places than the low back) was a statistically significant predictor, while smoking was not, despite its strong association with low back pain in the first half of the data set. We argue that the application of data splitting, in which an independent party handles the data set, will achieve for epidemiological surveys what pre-registration has done for clinical studies.

引用

页码：237 / 242

页数：6

共 8 条

[1]

Abdi Herve., 2007, ENCY MEASUREMENT STA, P1

[2] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].

BENJAMINI, Y ;

HOCHBERG, Y .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300

[3]

Faraway JJ, 1998, COMP SCI STAT, V30, P332

[4]

Friedman J, 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5

[5] Why most published research findings are false [J].

Ioannidis, JPA .

PLOS MEDICINE, 2005, 2 (08) :696-701

[6]

Kourinka I., 1987, APPL ERGON, V18, P233

[7] MODEL SELECTION AND ACCOUNTING FOR MODEL UNCERTAINTY IN GRAPHICAL MODELS USING OCCAMS WINDOW [J].

MADIGAN, D ;

RAFTERY, AE .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (428) :1535-1546

[8] Two-stage designs for gene-disease association studies [J].

Satagopan, JM ;

Verbel, DA ;

Venkatraman, ES ;

Offit, KE ;

Begg, CB .

BIOMETRICS, 2002, 58 (01) :163-170

← 1 →