The effects of selection against spurious transcription factor binding sites

被引:70
作者
Hahn, MW [1 ]
Stajich, JE [1 ]
Wray, GA [1 ]
机构
[1] Duke Univ, Dept Biol, Durham, NC 27706 USA
关键词
comparative genomics; natural selection; motif bias; promoters;
D O I
10.1093/molbev/msg096
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Most genomes contain nucleotide sequences with no known function; such sequences are assumed to be free of constraints, evolving only according to the vagaries of mutation. Here we show that selection acts to remove spurious transcription factor binding site motifs throughout 52 fully sequenced genomes of Eubacteria and Archaea. Examining the sequences necessary for polymerase binding, we find that spurious binding sites are underrepresented in both coding and noncoding regions. The average proportion of spurious binding sites found relative to the expected is 80% in eubacterial genomes and 89% in archaeal genomes. We also estimate the strength of selection against spurious binding sites in the face of the constant creation of new binding sites via mutation. Under conservative assumptions, we estimate that selection is weak, with the average efficacy of selection against spurious binding sites, N(e)s, of -0.12 for eubacterial Genomes and -0.06 for archaeal genomes, similar to that of codon bias. Our results suggest that both coding and noncoding sequences are constrained by selection to avoid specific regions of sequence space.
引用
收藏
页码:901 / 906
页数:6
相关论文
共 36 条
  • [1] AKASHI H, 1995, GENETICS, V139, P1067
  • [2] CRITICA: Coding region identification tool invoking comparative analysis
    Badger, JH
    Olsen, GJ
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (04) : 512 - 524
  • [3] TRANSCRIPTION - NEW INSIGHTS FROM STUDIES ON ARCHAEA
    BAUMANN, P
    QURESHI, SA
    JACKSON, SP
    [J]. TRENDS IN GENETICS, 1995, 11 (07) : 279 - 283
  • [4] Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome
    Berman, BP
    Nibu, Y
    Pfeiffer, BD
    Tomancak, P
    Celniker, SE
    Levine, M
    Rubin, GM
    Eisen, MB
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) : 757 - 762
  • [5] OVER-REPRESENTATION AND UNDER-REPRESENTATION OF SHORT OLIGONUCLEOTIDES IN DNA-SEQUENCES
    BURGE, C
    CAMPBELL, AM
    KARLIN, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (04) : 1358 - 1362
  • [6] CHARLESWORTH B, 1993, GENETICS, V134, P1289
  • [7] Comeron JM, 2002, GENETICS, V161, P389
  • [8] Davidson E. H., 2001, Genomic regulatory systems: development and evolution
  • [9] Genomic signature: Characterization and classification of species assessed by chaos game representation of sequences
    Deschavanne, PJ
    Giron, A
    Vilain, J
    Fagot, G
    Fertil, B
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (10) : 1391 - 1399
  • [10] FAIRALL L, 2001, TRANSCRIPTION FACTOR, P65