Normal uniform mixture differential gene expression detection for cDNA microarrays

被引:44
作者
Dean, N [1 ]
Raftery, AE [1 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
关键词
D O I
10.1186/1471-2105-6-173
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: One of the primary tasks in analysing gene expression data is finding genes that are differentially expressed in different samples. Multiple testing issues due to the thousands of tests run make some of the more popular methods for doing this problematic. Results: We propose a simple method, Normal Uniform Differential Gene Expression ( NUDGE) detection for finding differentially expressed genes in cDNA microarrays. The method uses a simple univariate normal-uniform mixture model, in combination with new normalization methods for spread as well as mean that extend the lowess normalization of Dudoit, Yang, Callow and Speed ( 2002) [ 1]. It takes account of multiple testing, and gives probabilities of differential expression as part of its output. It can be applied to either single-slide or replicated experiments, and it is very fast. Three datasets are analyzed using NUDGE, and the results are compared to those given by other popular methods: unadjusted and Bonferroni-adjusted t tests, Significance Analysis of Microarrays (SAM), and Empirical Bayes for microarrays (EBarrays) with both Gamma-Gamma and Lognormal-Normal models. Conclusion: The method gives a high probability of differential expression to genes known/ suspected a priori to be differentially expressed and a low probability to the others. In terms of known false positives and false negatives, the method outperforms all multiple-replicate methods except for the Gamma-Gamma EBarrays method to which it offers comparable results with the added advantages of greater simplicity, speed, fewer assumptions and applicability to the single replicate case. An R package called nudge to implement the methods in this paper will be made available soon at http://www.bioconductor.org.
引用
收藏
页数:14
相关论文
共 21 条
[1]   Global gene expression profiling in Escherichia coli K12 -: The effects of integration host factor [J].
Arfin, SM ;
Long, AD ;
Ito, ET ;
Tolleri, L ;
Riehle, MM ;
Paegle, ES ;
Hatfield, GW .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (38) :29672-29684
[2]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[3]   Bayesian hierarchical model for identifying changes in gene expression from microarray experiments [J].
Broët, P ;
Richardson, S ;
Radvanyi, F .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (04) :671-683
[4]  
Chen Y, 1997, J Biomed Opt, V2, P364, DOI 10.1117/12.281504
[5]   ROBUST LOCALLY WEIGHTED REGRESSION AND SMOOTHING SCATTERPLOTS [J].
CLEVELAND, WS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :829-836
[6]   LOCALLY WEIGHTED REGRESSION - AN APPROACH TO REGRESSION-ANALYSIS BY LOCAL FITTING [J].
CLEVELAND, WS ;
DEVLIN, SJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (403) :596-610
[7]   A simple procedure for the selection of significant effects [J].
Cox, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2004, 66 :395-400
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   Multiple hypothesis testing in microarray experiments [J].
Dudoit, S ;
Shaffer, JP ;
Boldrick, JC .
STATISTICAL SCIENCE, 2003, 18 (01) :71-103
[10]  
Dudoit S, 2002, STAT SINICA, V12, P111