On parametric empirical Bayes methods for comparing multiple groups using replicated gene expression profiles

被引:252
作者
Kendziorski, CM
Newton, MA
Lan, H
Gould, MN
机构
[1] Univ Wisconsin, Dept Biostat & Med Informat, Med Sci Ctr 6729, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
[3] Univ Wisconsin, McArdle Lab Canc Res, Madison, WI 53706 USA
关键词
hierarchical model; mixture model; microarray; differential expression; breast cancer;
D O I
10.1002/sim.1548
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
DNA microarrays provide for unprecedented large-scale views of gene expression and, as a result, have emerged as a fundamental measurement tool in the study of diverse biological systems. Statistical questions abound, but many traditional data analytic approaches do not apply, in large part because thousands of individual genes are measured with relatively little replication. Empirical Bayes methods provide a natural approach to microarray data analysis because they can significantly reduce the dimensionality of an inference problem while compensating for relatively few replicates by using information across the array. We propose a general empirical Bayes modelling approach which allows for replicate expression profiles in multiple conditions. The hierarchical mixture model accounts for differences among genes in their average expression levels, differential expression for a given gene among cell types, and measurement fluctuations. Two distinct parameterizations are considered: a model based on Gamma distributed measurements and one based on log-normally distributed measurements. False discovery rate and related operating characteristics of the methodology are assessed in a simulation study. We also show how the posterior odds of differential expression in one version of the model is related to the ratio of the arithmetic mean to the geometric mean of the two sample means. The methodology is used in a study of mammary cancer in the rat, where four distinct patterns of expression are possible. Copyright (C) 2003 John Wiley Sons, Ltd.
引用
收藏
页码:3899 / 3914
页数:16
相关论文
共 24 条
  • [1] A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes
    Baldi, P
    Long, AD
    [J]. BIOINFORMATICS, 2001, 17 (06) : 509 - 519
  • [2] Carlin B. P., 2001, BAYES EMPIRICAL BAYE
  • [3] Chen Y, 1997, J Biomed Opt, V2, P364, DOI 10.1117/12.281504
  • [4] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [5] Dudoit S, 2002, STAT SINICA, V12, P111
  • [6] EFRON B, 1973, J ROY STAT SOC B MET, V35, P379
  • [7] STEINS PARADOX IN STATISTICS
    EFRON, B
    MORRIS, C
    [J]. SCIENTIFIC AMERICAN, 1977, 236 (05) : 119 - 127
  • [8] Empirical Bayes analysis of a microarray experiment
    Efron, B
    Tibshirani, R
    Storey, JD
    Tusher, V
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) : 1151 - 1160
  • [9] EFRON B, 2000, 37B213 STANF U DEP S
  • [10] BAYES FACTORS
    KASS, RE
    RAFTERY, AE
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (430) : 773 - 795