An empirical bayes adjustment to increase the sensitivity of detecting differentially expressed genes in microarray experiments

被引:23
作者
Datta, S [1 ]
Satten, GA
Xia, JZ
Heslin, MJ
Datta, S [1 ]
机构
[1] Univ Georgia, Dept Stat, Athens, GA 30602 USA
[2] Georgia State Univ, Dept Math & Stat, Atlanta, GA 30303 USA
[3] Ctr Dis Control & Prevent, Atlanta, GA 30333 USA
[4] Univ Alabama Birmingham, Dept Physiol & Biophys, Birmingham, AL 35294 USA
[5] Univ Alabama Birmingham, Dept Surg, Birmingham, AL 35294 USA
关键词
D O I
10.1093/bioinformatics/btg396
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Detection of differentially expressed genes is one of the major goals of microarray experiments. Pairwise comparison for each gene is not appropriate without controlling the overall (experimentwise) type 1 error rate. Dudoit et al. have advocated use of permutation-based step-down P-value adjustments to correct the observed significance levels for the individual (i.e. for each gene) two sample t-tests. Results: In this paper, we consider an ANOVA formulation of the gene expression levels corresponding to multiple tissue types. We provide resampling-based step-down adjustments to correct the observed significance levels for the individual ANOVA t-tests for each gene and for each pair of tissue type comparisons. More importantly, we introduce a novel empirical Bayes adjustment to the t-test statistics that can be incorporated into the step-down procedure. Using simulated data, we show that the empirical Bayes adjustment improved the sensitivity of detecting differentially expressed genes up to 16%, while maintaining a high level of specificity. This adjustment also reduces the false non-discovery rate to some degree at the cost of a modest increase in the false discovery rate. We illustrate our approach using a human colon cancer dataset consisting of oligonucleotide arrays of normal, adenoma and carcinoma cells. The number of genes with differential expression level declared statistically significant was about 50 when comparing normal to adenoma cells and about five when comparing adenoma to carcinoma cells. This list includes genes previously known to be associated with colon cancer as well as some novel genes.
引用
收藏
页码:235 / 242
页数:8
相关论文
共 20 条
[1]  
[Anonymous], 1993, Resampling-based multiple testing: Examples and methods for P-value adjustment
[2]  
[Anonymous], STAT SIGNIFICANCE GE
[3]  
Cohen MB, 1998, LAB INVEST, V78, P101
[4]  
Dudoit S, 2002, STAT SINICA, V12, P111
[5]   DATA-ANALYSIS USING STEINS ESTIMATOR AND ITS GENERALIZATIONS [J].
EFRON, B ;
MORRIS, C .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (350) :311-319
[6]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160
[7]   Empirical Bayes methods and false discovery rates for microarrays [J].
Efron, B ;
Tibshirani, R .
GENETIC EPIDEMIOLOGY, 2002, 23 (01) :70-86
[8]  
GIBBONS RD, 2000, IN PRESS JSPI
[9]   Bayesian models for gene expression with DNA microarray data [J].
Ibrahim, JG ;
Chen, MH ;
Gray, RJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :88-99
[10]   Exploration, normalization, and summaries of high density oligonucleotide array probe level data [J].
Irizarry, RA ;
Hobbs, B ;
Collin, F ;
Beazer-Barclay, YD ;
Antonellis, KJ ;
Scherf, U ;
Speed, TP .
BIOSTATISTICS, 2003, 4 (02) :249-264