Improving the Quality of Biomarker Discovery Research: The Right Samples and Enough of Them

被引:49
作者
Pepe, Margaret S. [1 ]
Li, Christopher I. [2 ]
Feng, Ziding [3 ]
机构
[1] Fred Hutchinson Canc Res Ctr, Biostat & Biomathemat Program, Div Publ Hlth Sci, Seattle, WA 98105 USA
[2] Fred Hutchinson Canc Res Ctr, Translat Res Program, Div Publ Hlth Sci, Seattle, WA 98105 USA
[3] Univ Texas MD Anderson Canc Ctr, Div Quantitat Sci, Dept Biostat, Houston, TX 77030 USA
关键词
MICROARRAY EXPERIMENTS; SIZE DETERMINATION; CLASSIFICATION; DESIGN;
D O I
10.1158/1055-9965.EPI-14-1227
中图分类号
R73 [肿瘤学];
学科分类号
100214 [肿瘤学];
摘要
Background: Biomarker discovery research has yielded few biomarkers that validate for clinical use. A contributing factor may be poor study designs. Methods: The goal in discovery research is to identify a subset of potentially useful markers from a large set of candidates assayed on case and control samples. We recommend the PRoBE design for selecting samples. We propose sample size calculations that require specifying: (i) a definition for biomarker performance; (ii) the proportion of useful markers the study should identify (Discovery Power); and (iii) the tolerable number of useless markers amongst those identified (False Leads Expected, FLE). Results: We apply the methodology to a study of 9,000 candidate biomarkers for risk of colon cancer recurrence where a useful biomarker has positive predictive value >= 30%. We find that 40 patients with recurrence and 160 without recurrence suffice to filter out 98% of useless markers (2% FLE) while identifying 95% of useful biomarkers (95% Discovery Power). Alternative methods for sample size calculation required more assumptions. Conclusions: Biomarker discovery research should utilize quality biospecimen repositories and include sample sizes that enable markers meeting prespecified performance characteristics for well-defined clinical applications to be identified. Impact: The scientific rigor of discovery research should be improved. (C) 2015 AACR.
引用
收藏
页码:944 / 950
页数:7
相关论文
共 16 条
[1]
[Anonymous], 2003, The statistical evaluation of medical tests for classification and prediction
[2]
Baker SG, 2009, JNCI-J NATL CANCER I, V101, P1
[3]
Using microarrays to study the microenvironment in tumor biology: The crucial role of statistics [J].
Baker, Stuart G. ;
Kramer, Barnett S. .
SEMINARS IN CANCER BIOLOGY, 2008, 18 (05) :305-310
[4]
Sample size determination in microarray experiments for class comparison and prognostic classification [J].
Dobbin, K ;
Simon, R .
BIOSTATISTICS, 2005, 6 (01) :27-38
[5]
Sample size planning for developing classifiers using high-dimensional DNA microarray data [J].
Dobbin, Kevin K. ;
Simon, Richard M. .
BIOSTATISTICS, 2007, 8 (01) :101-117
[6]
Research issues and strategies for genomic and proteomic biornarker discovery and validation: a statistical perspective [J].
Feng, Z ;
Prentice, R ;
Srivastava, S .
PHARMACOGENOMICS, 2004, 5 (06) :709-719
[7]
Classification versus association models: Should the same methods apply? [J].
Feng, Ziding .
SCANDINAVIAN JOURNAL OF CLINICAL & LABORATORY INVESTIGATION, 2010, 70 :53-58
[8]
Translation of proteomic biomarkers into FDA approved cancer diagnostics: issues and challenges [J].
Fuezery, Anna K. ;
Levin, Joshua ;
Chan, Maria M. ;
Chan, Daniel W. .
CLINICAL PROTEOMICS, 2013, 10
[9]
FDR-controlling testing procedures and sample size determination for microarrays [J].
Li, SYS ;
Bigler, J ;
Lampe, JW ;
Potter, JD ;
Feng, ZD .
STATISTICS IN MEDICINE, 2005, 24 (15) :2267-2280
[10]
Pivotal Evaluation of the Accuracy of a Biomarker Used for Classification or Prediction: Standards for Study Design [J].
Pepe, Margaret S. ;
Feng, Ziding ;
Janes, Holly ;
Bossuyt, Patrick M. ;
Potter, John D. .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2008, 100 (20) :1432-1438