Detecting differential gene expression with a semiparametric hierarchical mixture method

被引:400
作者
Newton, MA
Noueiry, A
Sarkar, D
Ahlquist, P
机构
[1] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53792 USA
[3] Univ Wisconsin, Inst Mol Virol, Madison, WI 53706 USA
[4] Univ Wisconsin, Inst Mol Virol, Madison, WI 53706 USA
关键词
empirical Bayes; false discovery rate; microarrays; ordered alternatives; posterior probability; statistical analysis;
D O I
10.1093/biostatistics/5.2.155
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Mixture modeling provides an effective approach to the differential expression problem in microarray data analysis. Methods based on fully parametric mixture models are available, but lack of fit in some examples indicates that more flexible models may be beneficial. Existing, more flexible, mixture models work at the level of one-dimensional gene-specific summary statistics, and so when there are relatively few measurements per gene these methods may not provide sensitive detectors of differential expression. We propose a hierarchical mixture model to provide methodology that is both sensitive in detecting differential expression and sufficiently flexible to account for the complex variability of normalized microarray data. EM-based algorithms are used to fit both parametric and semiparametric versions of the model. We restrict attention to the two-sample comparison problem; an experiment involving Affymetrix microarrays and yeast translation provides the motivating case study. Gene-specific posterior probabilities of differential expression form the basis of statistical inference; they define short gene lists and false discovery rates. Compared to several competing methodologies, the proposed methodology exhibits good operating characteristics in a simulation study, on the analysis of spike-in data, and in a cross-validation calculation.
引用
收藏
页码:155 / 176
页数:22
相关论文
共 35 条
  • [1] A mixture model approach for the analysis of microarray gene expression data
    Allison, DB
    Gadbury, GL
    Heo, MS
    Fernández, JR
    Lee, CK
    Prolla, TA
    Weindruch, R
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 39 (01) : 1 - 20
  • [2] ANTONELLIS KJ, 2002, OPTIMIZATION EXTERNA
  • [3] Identifying differentially expressed genes in cDNA microarray experiments
    Baggerly, KA
    Coombes, KR
    Hess, KR
    Stivers, DN
    Abruzzo, LV
    Zhang, W
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) : 639 - 659
  • [4] A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes
    Baldi, P
    Long, AD
    [J]. BIOINFORMATICS, 2001, 17 (06) : 509 - 519
  • [5] BERGER J. O., 2013, Statistical Decision Theory and Bayesian Analysis, DOI [10.1007/978-1-4757-4286-2, DOI 10.1007/978-1-4757-4286-2]
  • [6] Bayesian hierarchical model for identifying changes in gene expression from microarray experiments
    Broët, P
    Richardson, S
    Radvanyi, F
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (04) : 671 - 683
  • [7] Chen Y, 1997, J Biomed Opt, V2, P364, DOI 10.1117/12.281504
  • [8] Multiple hypothesis testing in microarray experiments
    Dudoit, S
    Shaffer, JP
    Boldrick, JC
    [J]. STATISTICAL SCIENCE, 2003, 18 (01) : 71 - 103
  • [9] Dudoit S, 2002, STAT SINICA, V12, P111
  • [10] Empirical Bayes analysis of a microarray experiment
    Efron, B
    Tibshirani, R
    Storey, JD
    Tusher, V
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) : 1151 - 1160