Phylogenetically and spatially conserved word pairs associated with gene-expression changes in yeasts

被引:43
作者
Chiang, DY
Moses, AM
Kellis, M
Lander, ES
Eisen, MB
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Lab, Div Life Sci, Dept Genome Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Mol & Cell Biol, Ctr Integrat Genom, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Grad Grp Biophys, Berkeley, CA 94720 USA
[4] MIT, Dept Comp Sci, Whitehead MIT Ctr Genome Res, Cambridge, MA 02139 USA
[5] MIT, Dept Biol, Whitehead MIT Ctr Genome Res, Cambridge, MA 02139 USA
[6] Univ Calif Berkeley, Dept Mol & Cell Biol, Div Genet & Dev, Berkeley, CA 94720 USA
关键词
D O I
10.1186/gb-2003-4-7-r43
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Transcriptional regulation in eukaryotes often involves multiple transcription factors binding to the same transcription control region, and to understand the regulatory content of eukaryotic genomes it is necessary to consider the co-occurrence and spatial relationships of individual binding sites. The determination of conserved sequences (often known as phylogenetic footprinting) has identified individual transcription factor binding sites. We extend this concept of functional conservation to higher-order features of transcription control regions. Results: We used the genome sequences of four yeast species of the genus Saccharomyces to identify sequences potentially involved in multifactorial control of gene expression. We found 989 potential regulatory 'templates': pairs of hexameric sequences that are jointly conserved in transcription regulatory regions and also exhibit non-random relative spacing. Many of the individual sequences in these templates correspond to known transcription factor binding sites, and the sets of genes containing a particular template in their transcription control regions tend to be differentially expressed in conditions where the corresponding transcription factors are known to be active. The incorporation of word pairs to define sequence features yields more specific predictions of average expression profiles and more informative regression models for genome-wide expression data than considering sequence conservation alone. Conclusions: The incorporation of both joint conservation and spacing constraints of sequence pairs predicts groups of target genes that are specific for common patterns of gene expression. Our work suggests that positional information, especially the relative spacing between transcription factor binding sites, may represent a common organizing principle of transcription control regions.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [2] Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome
    Berman, BP
    Nibu, Y
    Pfeiffer, BD
    Tomancak, P
    Celniker, SE
    Levine, M
    Rubin, GM
    Eisen, MB
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) : 757 - 762
  • [3] Mutations in the Pho2 (Bas2) transcription factor that differentially affect activation with its partner proteins Bas1, Pho4, and Swi5
    Bhoite, LT
    Allen, JM
    Garcia, E
    Thomas, LR
    Gregory, ID
    Voth, WP
    Whelihan, K
    Rolfes, RJ
    Stillman, DJ
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (40) : 37612 - 37618
  • [4] Multiple transcriptional activation complexes tether the yeast activator Met4 to DNA
    Blaiseau, PL
    Thomas, D
    [J]. EMBO JOURNAL, 1998, 17 (21) : 6327 - 6336
  • [5] Discovery of regulatory elements by a computational method for phylogenetic footprinting
    Blanchette, M
    Tompa, M
    [J]. GENOME RESEARCH, 2002, 12 (05) : 739 - 748
  • [6] Regulatory element detection using correlation with expression
    Bussemaker, HJ
    Li, H
    Siggia, ED
    [J]. NATURE GENETICS, 2001, 27 (02) : 167 - 171
  • [7] Remodeling of yeast genome expression in response to environmental changes
    Causton, HC
    Ren, B
    Koh, SS
    Harbison, CT
    Kanin, E
    Jennings, EG
    Lee, TI
    True, HL
    Lander, ES
    Young, RA
    [J]. MOLECULAR BIOLOGY OF THE CELL, 2001, 12 (02) : 323 - 337
  • [8] Chiang D Y, 2001, Bioinformatics, V17 Suppl 1, pS49
  • [9] A genome-wide transcriptional analysis of the mitotic cell cycle
    Cho, RJ
    Campbell, MJ
    Winzeler, EA
    Steinmetz, L
    Conway, A
    Wodicka, L
    Wolfsberg, TG
    Gabrielian, AE
    Landsman, D
    Lockhart, DJ
    Davis, RW
    [J]. MOLECULAR CELL, 1998, 2 (01) : 65 - 73
  • [10] Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis
    Cliften, PF
    Hillier, LW
    Fulton, L
    Graves, T
    Miner, T
    Gish, WR
    Waterston, RH
    Johnston, M
    [J]. GENOME RESEARCH, 2001, 11 (07) : 1175 - 1186