Bayesian models for pooling microarray studies with multiple sources of replications

被引:36
作者
Conlon, Erin M. [1 ]
Song, Joon J.
Liu, Jun S.
机构
[1] Univ Massachusetts, Dept Math & Stat, Amherst, MA 01003 USA
[2] Harvard Univ, Dept Stat, Cambridge, MA 02138 USA
关键词
D O I
10.1186/1471-2105-7-247
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biologists often conduct multiple but different cDNA microarray studies that all target the same biological system or pathway. Within each study, replicate slides within repeated identical experiments are often produced. Pooling information across studies can help more accurately identify true target genes. Here, we introduce a method to integrate multiple independent studies efficiently. Results: We introduce a Bayesian hierarchical model to pool cDNA microarray data across multiple independent studies to identify highly expressed genes. Each study has multiple sources of variation, i.e. replicate slides within repeated identical experiments. Our model produces the genespecific posterior probability of differential expression, which provides a direct method for ranking genes, and provides Bayesian estimates of false discovery rates (FDR). In simulations combining two and five independent studies, with fixed FDR levels, we observed large increases in the number of discovered genes in pooled versus individual analyses. When the number of output genes is fixed ( e. g., top 100), the pooled model found appreciably more truly differentially expressed genes than the individual studies. We were also able to identify more differentially expressed genes from pooling two independent studies in Bacillus subtilis than from each individual data set. Finally, we observed that in our simulation studies our Bayesian FDR estimates tracked the true FDRs very well. Conclusion: Our method provides a cohesive framework for combining multiple but not identical microarray studies with several sources of replication, with data produced from the same platform. We assume that each study contains only two conditions: an experimental and a control sample. We demonstrated our model's suitability for a small number of studies that have been either pre-scaled or have no outliers.
引用
收藏
页数:13
相关论文
共 44 条
[1]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]   Bayesian hierarchical model for identifying changes in gene expression from microarray experiments [J].
Broët, P ;
Richardson, S ;
Radvanyi, F .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (04) :671-683
[4]   Combining multiple microarray studies and modeling interstudy variation [J].
Choi, Jung Kyoon ;
Yu, Ungsik ;
Kim, Sangsoo ;
Yoo, Ook Joon .
BIOINFORMATICS, 2003, 19 :i84-i90
[5]   Determining and analyzing differentially expressed genes from cDNA microarray experiments with complementary designs [J].
Conlon, EM ;
Eichenberger, P ;
Liu, JS .
JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 90 (01) :1-18
[6]   A Bayesian mixture model for differential gene expression [J].
Do, KA ;
Müller, P ;
Tang, F .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2005, 54 :627-644
[7]  
Dudoit S, 2002, STAT SINICA, V12, P111
[8]   BAYES METHODS FOR COMBINING THE RESULTS OF CANCER STUDIES IN HUMANS AND OTHER SPECIES [J].
DUMOUCHEL, WH ;
HARRIS, JE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (382) :293-308
[9]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160
[10]   The σΕ regulon and the identification of additional sporulation genes in Bacillus subtilis [J].
Eichenberger, P ;
Jensen, ST ;
Conlon, EM ;
van Ooij, C ;
Silvaggi, J ;
González-Pastor, JE ;
Fujita, M ;
Ben-Yehuda, S ;
Stragier, P ;
Liu, JS ;
Losick, R .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 327 (05) :945-972