Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes

被引:267
作者
Mirkin, BG
Fenner, TI
Galperin, MY
Koonin, EV [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
[2] Univ London Birkbeck Coll, Sch Informat Syst & Comp Sci, London WC1E 7HX, England
关键词
D O I
10.1186/1471-2148-3-2
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Comparative analysis of sequenced genomes reveals numerous instances of apparent horizontal gene transfer (HGT), at least in prokaryotes, and indicates that lineage-specific gene loss might have been even more common in evolution. This complicates the notion of a species tree, which needs to be re-interpreted as a prevailing evolutionary trend, rather than the full depiction of evolution, and makes reconstruction of ancestral genomes a non-trivial task. Results: We addressed the problem of constructing parsimonious scenarios for individual sets of orthologous genes given a species tree. The orthologous sets were taken from the database of Clusters of Orthologous Groups of proteins (COGs). We show that the phyletic patterns ( patterns of presence-absence in completely sequenced genomes) of almost 90% of the COGs are inconsistent with the hypothetical species tree. Algorithms were developed to reconcile the phyletic patterns with the species tree by postulating gene loss, COG emergence and HGT (the latter two classes of events were collectively treated as gene gains). We prove that each of these algorithms produces a parsimonious evolutionary scenario, which can be represented as mapping of loss and gain events on the species tree. The distribution of the evolutionary events among the tree nodes substantially depends on the underlying assumptions of the reconciliation algorithm, e. g. whether or not independent gene gains (gain after loss after gain) are permitted. Biological considerations suggest that, on average, gene loss might be a more likely event than gene gain. Therefore different gain penalties were used and the resulting series of reconstructed gene sets for the last universal common ancestor (LUCA) of the extant life forms were analysed. The number of genes in the reconstructed LUCA gene sets grows as the gain penalty increases. However, qualitative examination of the LUCA versions reconstructed with different gain penalties indicates that, even with a gain penalty of 1 (equal weights assigned to a gain and a loss), the set of 572 genes assigned to LUCA might be nearly sufficient to sustain a functioning organism. Under this gain penalty value, the numbers of horizontal gene transfer and gene loss events are nearly identical. This result holds true for two alternative topologies of the species tree and even under random shuffling of the tree. Therefore, the results seem to be compatible with approximately equal likelihoods of HGT and gene loss in the evolution of prokaryotes. Conclusions: The notion that gene loss and HGT are major aspects of prokaryotic evolution was supported by quantitative analysis of the mapping of the phyletic patterns of COGs onto a hypothetical species tree. Algorithms were developed for constructing parsimonious evolutionary scenarios, which include gene loss and gain events, for orthologous gene sets, given a species tree. This analysis shows, contrary to expectations, that the number of predicted HGT events that occurred during the evolution of prokaryotes might be approximately the same as the number of gene losses. The approach to the reconstruction of evolutionary scenarios employed here is conservative with regard to the detection of HGT because only patterns of gene presence-absence in sequenced genomes are taken into account. In reality, horizontal transfer might have contributed to the evolution of many other genes also, which makes it a dominant force in prokaryotic evolution.
引用
收藏
页数:34
相关论文
共 83 条
  • [1] Evidence for massive gene exchange between archaeal and bacterial hyperthermophiles
    Aravind, L
    Tatusov, RL
    Wolf, YI
    Walker, DR
    Koonin, EV
    [J]. TRENDS IN GENETICS, 1998, 14 (11) : 442 - 444
  • [2] Trends in protein evolution inferred from sequence and structure analysis
    Aravind, L
    Mazumder, R
    Vasudevan, S
    Koonin, EV
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (03) : 392 - 399
  • [3] The evolutionary history of ribosomal protein RpS14: horizontal gene transfer at the heart of the ribosome
    Brochier, C
    Philippe, H
    Moreira, D
    [J]. TRENDS IN GENETICS, 2000, 16 (12) : 529 - 533
  • [4] Eubacterial phylogeny based on translational apparatus proteins
    Brochier, C
    Bapteste, E
    Moreira, D
    Philippe, H
    [J]. TRENDS IN GENETICS, 2002, 18 (01) : 1 - 5
  • [5] Archaea and the prokaryote-to-eukaryote transition
    Brown, JR
    Doolittle, WF
    [J]. MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 1997, 61 (04) : 456 - +
  • [6] Universal trees based on large combined protein sequence data sets
    Brown, JR
    Douady, CJ
    Italia, MJ
    Marshall, WE
    Stanhope, MJ
    [J]. NATURE GENETICS, 2001, 28 (03) : 281 - 285
  • [7] Inferring genome trees by using a filter to eliminate phylogenetically discordant sequences and a distance matrix based on mean normalized BLASTP scores
    Clarke, GDP
    Beiko, RG
    Ragan, MA
    Charlebois, RL
    [J]. JOURNAL OF BACTERIOLOGY, 2002, 184 (08) : 2072 - 2080
  • [8] Conservation of gene order: a fingerprint of proteins that physically interact
    Dandekar, T
    Snel, B
    Huynen, M
    Bork, P
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) : 324 - 328
  • [9] The universal ancestor was a thermophile or a hyperthermophile
    Di Giulio, M
    [J]. GENE, 2001, 281 (1-2) : 11 - 17
  • [10] DNA repair systems in Archaea: Mementos from the last universal common ancestor?
    DiRuggiero, J
    Brown, JR
    Bogert, AP
    Robb, FT
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1999, 49 (04) : 474 - 484