Genome trees constructed using five different approaches suggest new major bacterial clades

被引:238
作者
Wolf, Yuri I. [1 ]
Rogozin, Igor B. [1 ]
Grishin, Nick V. [2 ,3 ]
Tatusov, Roman L. [1 ]
Koonin, Eugene V. [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
[2] Univ Texas SW Med Ctr Dallas, Howard Hughes Med Inst, Dallas, TX 75390 USA
[3] Univ Texas SW Med Ctr Dallas, Dept Biochem, Dallas, TX 75390 USA
关键词
D O I
10.1186/1471-2148-1-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The availability of multiple complete genome sequences from diverse taxa prompts the development of new phylogenetic approaches, which attempt to incorporate information derived from comparative analysis of complete gene sets or large subsets thereof. Such attempts are particularly relevant because of the major role of horizontal gene transfer and lineage-specific gene loss, at least in the evolution of prokaryotes. Results: Five largely independent approaches were employed to construct trees for completely sequenced bacterial and archaeal genomes: i) presence-absence of genomes in clusters of orthologous genes; ii) conservation of local gene order (gene pairs) among prokaryotic genomes; iii) parameters of identity distribution for probable orthologs; iv) analysis of concatenated alignments of ribosomal proteins; v) comparison of trees constructed for multiple protein families. All constructed trees support the separation of the two primary prokaryotic domains, bacteria and archaea, as well as some terminal bifurcations within the bacterial and archaeal domains. Beyond these obvious groupings, the trees made with different methods appeared to differ substantially in terms of the relative contributions of phylogenetic relationships and similarities in gene repertoires caused by similar life styles and horizontal gene transfer to the tree topology. The trees based on presence-absence of genomes in orthologous clusters and the trees based on conserved gene pairs appear to be strongly affected by gene loss and horizontal gene transfer. The trees based on identity distributions for orthologs and particularly the tree made of concatenated ribosomal protein sequences seemed to carry a stronger phylogenetic signal. The latter tree supported three potential high-level bacterial clades,: i) Chlamydia-Spirochetes, ii) Thermotogales-Aquificales (bacterial hyperthermophiles), and ii) Actinomycetes-Deinococcales-Cyanobacteria. The latter group also appeared to join the low-GC Gram-positive bacteria at a deeper tree node. These new groupings of bacteria were supported by the analysis of alternative topologies in the concatenated ribosomal protein tree using the Kishino-Hasegawa test and by a census of the topologies of 132 individual groups of orthologous proteins. Additionally, the results of this analysis put into question the sister-group relationship between the two major archaeal groups, Euryarchaeota and Crenarchaeota, and suggest instead that Euryarchaeota might be a paraphyletic group with respect to Crenarchaeota. Conclusions: We conclude that, the extensive horizontal gene flow and lineage-specific gene loss notwithstanding, extension of phylogenetic analysis to the genome scale has the potential of uncovering deep evolutionary relationships between prokaryotic lineages.
引用
收藏
页数:22
相关论文
共 52 条
  • [1] ADACHI J, 1992, MOLPHY PROGRAMS MOL
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] Evidence for massive gene exchange between archaeal and bacterial hyperthermophiles
    Aravind, L
    Tatusov, RL
    Wolf, YI
    Walker, DR
    Koonin, EV
    [J]. TRENDS IN GENETICS, 1998, 14 (11) : 442 - 444
  • [4] The root of the universal tree and the origin of eukaryotes based on elongation factor phylogeny
    Baldauf, SL
    Palmer, JD
    Doolittle, WF
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (15) : 7749 - 7754
  • [5] The evolutionary history of ribosomal protein RpS14: horizontal gene transfer at the heart of the ribosome
    Brochier, C
    Philippe, H
    Moreira, D
    [J]. TRENDS IN GENETICS, 2000, 16 (12) : 529 - 533
  • [6] Universal trees based on large combined protein sequence data sets
    Brown, JR
    Douady, CJ
    Italia, MJ
    Marshall, WE
    Stanhope, MJ
    [J]. NATURE GENETICS, 2001, 28 (03) : 281 - 285
  • [7] Conservation of gene order: a fingerprint of proteins that physically interact
    Dandekar, T
    Snel, B
    Huynen, M
    Bork, P
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) : 324 - 328
  • [8] Dobzhansky T, 1938, GENETICS, V23, P28
  • [9] Determining divergence times of the major kingdoms of living organisms with a protein clock
    Doolittle, RF
    Feng, DF
    Tsang, S
    Cho, G
    Little, E
    [J]. SCIENCE, 1996, 271 (5248) : 470 - 477
  • [10] Phylogenetic classification and the universal tree
    Doolittle, WF
    [J]. SCIENCE, 1999, 284 (5423) : 2124 - 2128