Predicting gene expression from sequence

被引:449
作者
Beer, MA
Tavazoie, S [1 ]
机构
[1] Princeton Univ, Lewis Sigler Inst Integrat Genom, Princeton, NJ 08544 USA
[2] Princeton Univ, Dept Mol Biol, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
D O I
10.1016/S0092-8674(04)00304-6
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe a systematic genome-wide approach for learning the complex combinatorial code underlying gene expression. Our probabilistic approach identifies local DNA-sequence elements and the positional and combinatorial constraints that determine their context-dependent role in transcriptional regulation. The inferred regulatory rules correctly predict expression patterns for 73% of genes in Saccharomyces cerevisiae, utilizing microarray expression data and sequences in the 800 bp upstream of genes. Application to Caenorhabditis elegans identifies predictive regulatory elements and combinatorial rules that control the phased temporal expression of transcription factors, histones, and germline specific genes. Successful prediction requires diverse and complex rules utilizing AND, OR, and NOT logic, with significant constraints on motif strength, orientation, and relative position. This system generates a large number of mechanistic hypotheses for focused experimental validation, and establishes a predictive dynamical framework for understanding cellular behavior from genomic sequence.
引用
收藏
页码:185 / 198
页数:14
相关论文
共 50 条
  • [1] SKN-1 links C-elegans mesendodermal specification to a conserved oxidative stress response
    An, JH
    Blackwell, TK
    [J]. GENES & DEVELOPMENT, 2003, 17 (15) : 1882 - 1893
  • [2] MODIFIERS OF POSITION EFFECT ARE SHARED BETWEEN TELOMERIC AND SILENT MATING-TYPE LOCI IN SACCHAROMYCES-CEREVISIAE
    APARICIO, OM
    BILLINGTON, BL
    GOTTSCHLING, DE
    [J]. CELL, 1991, 66 (06) : 1279 - 1287
  • [3] Composition and dynamics of the Caenorhabditis elegans early embryonic transcriptome
    Baugh, LR
    Hill, AA
    Slonim, DK
    Brown, EL
    Hunter, CP
    [J]. DEVELOPMENT, 2003, 130 (05): : 889 - 900
  • [4] FORMATION OF A MONOMERIC DNA-BINDING DOMAIN BY SKN-1 BZIP AND HOMEODOMAIN ELEMENTS
    BLACKWELL, TK
    BOWERMAN, B
    PRIESS, JR
    WEINTRAUB, H
    [J]. SCIENCE, 1994, 266 (5185) : 621 - 628
  • [5] SKN-1, A MATERNALLY EXPRESSED GENE REQUIRED TO SPECIFY THE FATE OF VENTRAL BLASTOMERES IN THE EARLY C-ELEGANS EMBRYO
    BOWERMAN, B
    EATON, BA
    PRIESS, JR
    [J]. CELL, 1992, 68 (06) : 1061 - 1075
  • [6] Bagging predictors
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140
  • [7] Regulatory element detection using correlation with expression
    Bussemaker, HJ
    Li, H
    Siggia, ED
    [J]. NATURE GENETICS, 2001, 27 (02) : 167 - 171
  • [8] A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression
    Cohen, BA
    Mitra, RD
    Hughes, JD
    Church, GM
    [J]. NATURE GENETICS, 2000, 26 (02) : 183 - 186
  • [9] COOPER GF, 1992, MACH LEARN, V9, P309, DOI 10.1007/BF00994110
  • [10] Regulatory gene networks and the properties of the developmental process
    Davidson, EH
    McCay, DR
    Hood, L
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (04) : 1475 - 1480