Sample size calculation with dependence a ustment for FDR-control in microarray studies

被引:24
作者
Shao, Yongzhao [1 ]
Tseng, Chi-Hong [1 ]
机构
[1] NYU, Sch Med, Div Biostat, New York, NY 10016 USA
关键词
dependence; false discovery rates; power; sample size; microarray data;
D O I
10.1002/sim.2862
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
DNA microarrays have been widely used for the purpose of simultaneously monitoring a large number of gene expression levels to identify differentially expressed genes. Statistical methods for the adjustment of multiple testing have been discussed extensively in the literature. An important further challenge is the existence of dependence among test statistics due to reasons such as gene co-regulation. To plan large-scale genomic studies, sample size determination with appropriate adjustment for both multiple testing and potential dependency among test statistics is crucial to avoid an abundance of false-positive results and/or serious lack of power. We introduce a general approach for calculating sample sizes for two-way multiple comparisons in the presence of dependence among test statistics to ensure adequate overall power when the false discovery rates are controlled. The usefulness of the proposed method is demonstrated via numerical studies using both simulated data and real data from a well-known study of leukaemia. Copyright (c) 2007 John Wiley & Sons, Ltd.
引用
收藏
页码:4219 / 4237
页数:19
相关论文
共 24 条
  • [1] AHN H, 1995, J COMPUTATIONAL GRAP, V4, P55
  • [2] Benjamini Y, 2001, ANN STAT, V29, P1165
  • [3] On the adaptive control of the false discovery fate in multiple testing with independent statistics
    Benjamini, Y
    Hochberg, Y
    [J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2000, 25 (01) : 60 - 83
  • [4] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [5] Billingsley P, 1968, CONVERGE PROBAB MEAS
  • [6] Empirical Bayes methods and false discovery rates for microarrays
    Efron, B
    Tibshirani, R
    [J]. GENETIC EPIDEMIOLOGY, 2002, 23 (01) : 70 - 86
  • [7] EFRON B, 2000, 37B213 STANF U DEP S
  • [8] Power and sample size estimation in high dimensional biology
    Gadbury, GL
    Page, GP
    Edwards, J
    Kayo, T
    Prolla, TA
    Weindruch, R
    Permana, PA
    Mountz, JD
    Allison, DB
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2004, 13 (04) : 325 - 338
  • [9] Resampling-based multiple testing for microarray data analysis
    Ge, YC
    Dudoit, S
    Speed, TP
    [J]. TEST, 2003, 12 (01) : 1 - 77
  • [10] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537