Improved statistical inference from DNA microarray data using analysis of variance and a Bayesian statistical framework -: Analysis of global gene expression in Escherichia coli K12

被引:291
作者
Long, AD [1 ]
Mangalam, HJ
Chan, BYP
Tolleri, L
Hatfield, GW
Baldi, P
机构
[1] Univ Calif Irvine, Sch Biol Sci, Dept Ecol & Evolutionary Biol, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Dept Informat & Comp Sci, Irvine, CA 92697 USA
[3] Univ Calif Irvine, Dept Microbiol & Mol Genet, Irvine, CA 92697 USA
[4] Univ Calif Irvine, Coll Med, Dept Biol Chem, Irvine, CA 92697 USA
[5] Univ Calif Irvine, Sch Engn, Dept Chem & Biochem Engn, Irvine, CA 92697 USA
[6] Natl Ctr Genome Resources, Santa Fe, NM USA
[7] Tacg Informat, Irvine, CA 92612 USA
关键词
D O I
10.1074/jbc.M010192200
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe statistical methods based on the t test that can be conveniently used on high density array data to test for statistically significant differences between treatments. These t tests employ either the observed variance among replicates within treatments or a Bayesian estimate of the variance among replicates within treatments based on a prior estimate obtained from a local estimate of the standard deviation. The Bayesian prior allows statistical inference to be made from microarray data even when experiments are only replicated at nominal levels. We apply these new statistical tests to a data set that examined differential gene expression patterns in IHF+ and IHF- Escherichia coli cells (Arfin, S, M., Long, A. D., Ito, E. T., Tolleri, L., Riehle, M. M., Paegle, E. S., and Hatfield, G. W. (2000) J, Biol. Chem, 275, 29672-29684), These analyses identify a more biologically reasonable set of candidate genes than those identified using statistical tests not incorporating a Bayesian prior. We also show that statistical tests based on analysis of variance and a Bayesian prior identify genes that are up or down-regulated following an experimental manipulation more reliably than approaches based only on a t test or fold change. All the described tests are implemented in a simple-to-use web interface called Cyber-T that is located on the University of California at Irvine genomics web site.
引用
收藏
页码:19937 / 19944
页数:8
相关论文
共 22 条
  • [1] Global gene expression profiling in Escherichia coli K12 -: The effects of integration host factor
    Arfin, SM
    Long, AD
    Ito, ET
    Tolleri, L
    Riehle, MM
    Paegle, ES
    Hatfield, GW
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (38) : 29672 - 29684
  • [2] A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes
    Baldi, P
    Long, AD
    [J]. BIOINFORMATICS, 2001, 17 (06) : 509 - 519
  • [3] Baldi P., 1998, Bioinformatics: The machine learning approach
  • [4] BOX GEP, 1992, BAYESIAN INFERENCE S, P92
  • [5] DeRisi J, 1996, NAT GENET, V14, P457
  • [6] Exploring the metabolic and genetic control of gene expression on a genomic scale
    DeRisi, JL
    Iyer, VR
    Brown, PO
    [J]. SCIENCE, 1997, 278 (5338) : 680 - 686
  • [7] LIGHT-DIRECTED, SPATIALLY ADDRESSABLE PARALLEL CHEMICAL SYNTHESIS
    FODOR, SPA
    READ, JL
    PIRRUNG, MC
    STRYER, L
    LU, AT
    SOLAS, D
    [J]. SCIENCE, 1991, 251 (4995) : 767 - 773
  • [8] HYBRIDIZATION FINGERPRINTING OF HIGH-DENSITY CDNA-LIBRARY ARRAYS WITH CDNA POOLS DERIVED FROM WHOLE TISSUES
    GRESS, TM
    HOHEISEL, JD
    LENNON, GG
    ZEHETNER, G
    LEHRACH, H
    [J]. MAMMALIAN GENOME, 1992, 3 (11) : 609 - 619
  • [9] Discovery and analysis of inflammatory disease-related genes using cDNA microarrays
    Heller, RA
    Schena, M
    Chai, A
    Shalon, D
    Bedilion, T
    Gilmore, J
    Woolley, DE
    Davis, RW
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (06) : 2150 - 2155
  • [10] Yeast microarrays for genome wide parallel genetic and gene expression analysis
    Lashkari, DA
    DeRisi, JL
    McCusker, JH
    Namath, AF
    Gentile, C
    Hwang, SY
    Brown, PO
    Davis, RW
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (24) : 13057 - 13062