Resampling-based multiple testing for microarray data analysis

被引:231
作者
Ge, YC
Dudoit, S
Speed, TP
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
[3] Walter & Eliza Hall Inst Med Res, Div Genet & Bioinformat, Parkville, Vic, Australia
关键词
multiple testing; family-wise error rate; false discovery rate; adjusted p-value; fast algorithm; minP; microarray;
D O I
10.1007/BF02595811
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The burgeoning field of genomics has revived interest in multiple testing procedures by raising new methodological and computational challenges. For example, microarray experiments generate large multiplicity problems in which thousands of hypotheses are tested simultaneously. Westfall and Young (1993) propose resampling-based p-value adjustment procedures which are highly relevant to microarray experiments. This article discusses different criteria for error control in resampling-based multiple testing, including (a) the family wise error rate of Westfall and Young (1993) and (b) the false discovery rate developed by Benjamini and Hochberg (1995), both from a frequentist viewpoint; and (c) the positive false discovery rate of Storey (2002a), which has a Bayesian motivation. We also introduce our recently developed fast algorithm for implementing the minP adjustment to control family-wise error rate. Adjusted p-values for different approaches are applied to gene expression data from two recently published microarray studies. The properties of these procedures for multiple testing are compared.
引用
收藏
页码:1 / 77
页数:77
相关论文
共 62 条
  • [11] Microarray expression profiling identifies genes with altered expression in HDL-deficient mice
    Callow, MJ
    Dudoit, S
    Gong, EL
    Speed, TP
    Rubin, EM
    [J]. GENOME RESEARCH, 2000, 10 (12) : 2022 - 2029
  • [12] Exploring the metabolic and genetic control of gene expression on a genomic scale
    DeRisi, JL
    Iyer, VR
    Brown, PO
    [J]. SCIENCE, 1997, 278 (5338) : 680 - 686
  • [13] Dudoit S, 2002, STAT SINICA, V12, P111
  • [14] DUDOIT S, 2002, UNPUB MULTIPLE HYPOT
  • [15] ESTIMATION OF THE MEANS OF DEPENDENT-VARIABLES
    DUNN, OJ
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1958, 29 (04): : 1095 - 1111
  • [16] Empirical Bayes analysis of a microarray experiment
    Efron, B
    Tibshirani, R
    Storey, JD
    Tusher, V
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) : 1151 - 1160
  • [17] Empirical Bayes methods and false discovery rates for microarrays
    Efron, B
    Tibshirani, R
    [J]. GENETIC EPIDEMIOLOGY, 2002, 23 (01) : 70 - 86
  • [18] EFRON B, 2000, 37B213 STANF U DEP S
  • [19] Finner H, 2001, BIOMETRICAL J, V43, P985, DOI 10.1002/1521-4036(200112)43:8<985::AID-BIMJ985>3.0.CO
  • [20] 2-4