Compositional biases of bacterial genomes and evolutionary implications

被引:324
作者
Karlin, S [1 ]
Mrazek, J [1 ]
Campbell, AM [1 ]
机构
[1] STANFORD UNIV,DEPT BIOL SCI,STANFORD,CA 94305
关键词
D O I
10.1128/jb.179.12.3899-3913.1997
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
We compare and contrast genome-wide compositional biases and distributions of short oligonucleotides across 15 diverse prokaryotes that have substantial genomic sequence collections, These include seven complete genomes (Escherichia coli, Haemophilus influenzae, Mycoplasma genitalium, Mycoplasma pneumoniae, Synechocystis sp, strain PCC6803, Methanococcus jannaschii, and Pyrobaculum aerophilum). A key observation concerns the constancy of the dinucleotide relative abundance profiles over multiple 50-kb disjoint contigs within the same genome, (The profile is rho(XY)* = f(XY)*/f(X)*/f(Y)* for all XY, where f(X)* denotes the frequency of the nucleotide X and f(XY)* denotes the frequency of the dinucleotide XY, both computed from the sequence concatenated with its inverted complementary sequence.) On the basis of this constancy, we refer to the collection {rho(XY)*} as the genome signature, We establish that the differences between {rho(XY)*} vectors of 50-kb sample contigs of different genomes virtually always exceed the differences between those of the same genomes, Various di- and tetranucleotide biases are identified, In particular, we find that the dinucleotide CpG=CG is underrepresented in many thermophiles (e.g., M. jannaschii, Sulfolobus sp., and M. thermoautotrophicum) but overrepresented in halobacteria, TA is broadly underrepresented in prokaryotes and eukaryotes, but normal counts appear in Sulfolobus and P. aerophilum sequences, More than for any other bacterial genome, palindromic tetranucleotides are underrepresented in H. influenzae. The M. jannschii sequence is unprecedented in its extreme underrepresentation of CTAG tetranucleotides and in the anomalous distribution of CTAG sites around the genome. Comparative analysis of numbers of long tetranucleotide microsatellites distinguishes H. influenzae. Dinucleotide relative abundance differences between bacterial sequences are compared, For example, in these assessments of differences, the cyanobacteria Synechocystis, Synechococcus, and Anabaena do not form a coherent group and are as far from each other as general gram-negative sequences are from general gram-positive sequences, The difference of M. jannaschii from low-G+C gram-positive proteobacteria is one-half of the difference from gram-negative proteobacteria, Interpretations and hypotheses center on the role of the genome signature in highlighting similarities and dissimilarities across different classes of prokaryotic species, possible mechanisms underlying the genome signature, the form and level of genome compositional flux, the use of the genome signature as a chronometer of molecular phylogeny, and implications with respect to the three putative eubacterial, archaeal, and eukaryote domains of life and to the origin and early evolution of eukaryotes.
引用
收藏
页码:3899 / 3913
页数:15
相关论文
共 67 条
  • [1] The root of the universal tree and the origin of eukaryotes based on elongation factor phylogeny
    Baldauf, SL
    Palmer, JD
    Doolittle, WF
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (15) : 7749 - 7754
  • [2] BENACHENHOULAHFA N, 1993, J MOL EVOL, V36, P335
  • [3] EVOLUTION OF THE GENOME AND THE GENETIC-CODE - SELECTION AT THE DINUCLEOTIDE LEVEL BY METHYLATION AND POLYRIBONUCLEOTIDE CLEAVAGE
    BEUTLER, E
    GELBART, T
    HAN, JH
    KOZIOL, JA
    BEUTLER, B
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (01) : 192 - 196
  • [4] DNA MISMATCH CORRECTION BY VERY SHORT PATCH REPAIR MAY HAVE ALTERED THE ABUNDANCE OF OLIGONUCLEOTIDES IN THE ESCHERICHIA-COLI GENOME
    BHAGWAT, AS
    MCCLELLAND, M
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 (07) : 1663 - 1668
  • [5] Bishop M.M., 1975, DISCRETE MULTIVARIAT
  • [6] Similarities and dissimilarities of phage genomes
    Blaisdell, BE
    Campbell, AM
    Karlin, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (12) : 5854 - 5859
  • [7] Evolutionary comparisons of RecA-like proteins across all major kingdoms of living organisms
    Brendel, V
    Brocchieri, L
    Sandler, SJ
    Clark, AJ
    Karlin, S
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1997, 44 (05) : 528 - 541
  • [8] PREDICTING DNA DUPLEX STABILITY FROM THE BASE SEQUENCE
    BRESLAUER, KJ
    FRANK, R
    BLOCKER, H
    MARKY, LA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (11) : 3746 - 3750
  • [9] BROWN JR, 1994, J MOL EVOL, V38, P566
  • [10] Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii
    Bult, CJ
    White, O
    Olsen, GJ
    Zhou, LX
    Fleischmann, RD
    Sutton, GG
    Blake, JA
    FitzGerald, LM
    Clayton, RA
    Gocayne, JD
    Kerlavage, AR
    Dougherty, BA
    Tomb, JF
    Adams, MD
    Reich, CI
    Overbeek, R
    Kirkness, EF
    Weinstock, KG
    Merrick, JM
    Glodek, A
    Scott, JL
    Geoghagen, NSM
    Weidman, JF
    Fuhrmann, JL
    Nguyen, D
    Utterback, TR
    Kelley, JM
    Peterson, JD
    Sadow, PW
    Hanna, MC
    Cotton, MD
    Roberts, KM
    Hurst, MA
    Kaine, BP
    Borodovsky, M
    Klenk, HP
    Fraser, CM
    Smith, HO
    Woese, CR
    Venter, JC
    [J]. SCIENCE, 1996, 273 (5278) : 1058 - 1073