Improved statistical inference from DNA microarray data using analysis of variance and a Bayesian statistical framework -: Analysis of global gene expression in Escherichia coli K12

被引:291
作者
Long, AD [1 ]
Mangalam, HJ
Chan, BYP
Tolleri, L
Hatfield, GW
Baldi, P
机构
[1] Univ Calif Irvine, Sch Biol Sci, Dept Ecol & Evolutionary Biol, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Dept Informat & Comp Sci, Irvine, CA 92697 USA
[3] Univ Calif Irvine, Dept Microbiol & Mol Genet, Irvine, CA 92697 USA
[4] Univ Calif Irvine, Coll Med, Dept Biol Chem, Irvine, CA 92697 USA
[5] Univ Calif Irvine, Sch Engn, Dept Chem & Biochem Engn, Irvine, CA 92697 USA
[6] Natl Ctr Genome Resources, Santa Fe, NM USA
[7] Tacg Informat, Irvine, CA 92612 USA
关键词
D O I
10.1074/jbc.M010192200
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe statistical methods based on the t test that can be conveniently used on high density array data to test for statistically significant differences between treatments. These t tests employ either the observed variance among replicates within treatments or a Bayesian estimate of the variance among replicates within treatments based on a prior estimate obtained from a local estimate of the standard deviation. The Bayesian prior allows statistical inference to be made from microarray data even when experiments are only replicated at nominal levels. We apply these new statistical tests to a data set that examined differential gene expression patterns in IHF+ and IHF- Escherichia coli cells (Arfin, S, M., Long, A. D., Ito, E. T., Tolleri, L., Riehle, M. M., Paegle, E. S., and Hatfield, G. W. (2000) J, Biol. Chem, 275, 29672-29684), These analyses identify a more biologically reasonable set of candidate genes than those identified using statistical tests not incorporating a Bayesian prior. We also show that statistical tests based on analysis of variance and a Bayesian prior identify genes that are up or down-regulated following an experimental manipulation more reliably than approaches based only on a t test or fold change. All the described tests are implemented in a simple-to-use web interface called Cyber-T that is located on the University of California at Irvine genomics web site.
引用
收藏
页码:19937 / 19944
页数:8
相关论文
共 22 条
  • [21] HIGH-DENSITY CDNA FILTER ANALYSIS OF THE EXPRESSION PROFILES OF THE GENES PREFERENTIALLY EXPRESSED IN HUMAN BRAIN
    TAKAHASHI, N
    HASHIDA, H
    ZHAO, ND
    MISUMI, Y
    SAKAKI, Y
    [J]. GENE, 1995, 164 (02) : 219 - 227
  • [22] HIGH-DENSITY CDNA FILTER ANALYSIS - A NOVEL-APPROACH FOR LARGE-SCALE, QUANTITATIVE-ANALYSIS OF GENE-EXPRESSION
    ZHAO, ND
    HASHIDA, H
    TAKAHASHI, N
    MISUMI, Y
    SAKAKI, Y
    [J]. GENE, 1995, 156 (02) : 207 - 213