Inferring and Validating Horizontal Gene Transfer Events Using Bipartition Dissimilarity

被引:60
作者
Boc, Alix [1 ]
Philippe, Herve [2 ]
Makarenkov, Vladimir [1 ]
机构
[1] Univ Quebec, Dept Informat, Montreal, PQ H3C 3P8, Canada
[2] Univ Montreal, Dept Biochim, Fac Med, Montreal, PQ H3C 3J7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Bipartition dissimilarity; bootstrap analysis; horizontal gene transfer; least squares; phylogenetic tree; quartet distance; Robinson and Foulds topological distance; HISTORICAL ASSOCIATIONS; PHYLOGENETIC NETWORKS; MAXIMUM-LIKELIHOOD; SEQUENCE EVOLUTION; TREES; GENOMES; RATES; SUBSTITUTION; ALGORITHMS; SIMULATION;
D O I
10.1093/sysbio/syp103
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Horizontal gene transfer (HGT) is one of the main mechanisms driving the evolution of microorganisms. Its accurate identification is one of the major challenges posed by reticulate evolution. In this article, we describe a new polynomial-time algorithm for inferring HGT events and compare 3 existing and 1 new tree comparison indices in the context of HGT identification. The proposed algorithm can rely on different optimization criteria, including least squares (LS), Robinson and Foulds (RF) distance, quartet distance (QD), and bipartition dissimilarity (BD), when searching for an optimal scenario of subtree prune and regraft (SPR) moves needed to transform the given species tree into the given gene tree. As the simulation results suggest, the algorithmic strategy based on BD, introduced in this article, generally provides better results than those based on LS, RF, and QD. The BD-based algorithm also proved to be more accurate and faster than a well-known polynomial time heuristic RIATA-HGT. Moreover, the HGT recovery results yielded by BD were generally equivalent to those provided by the exponential-time algorithm LatTrans, but a clear gain in running time was obtained using the new algorithm. Finally, a statistical framework for assessing the reliability of obtained HGTs by bootstrap analysis is also presented.
引用
收藏
页码:195 / 211
页数:17
相关论文
共 55 条
  • [21] On the complexity of comparing evolutionary trees
    Hein, J
    Jiang, T
    Wang, LS
    Zhang, KZ
    [J]. DISCRETE APPLIED MATHEMATICS, 1996, 71 (1-3) : 153 - 169
  • [22] Hickey G, 2008, EVOL BIOINFORM, V4, P17
  • [23] Inferring phylogenetic networks by the maximum parsimony criterion: A case study
    Jin, Guohua
    Nakhleh, Luay
    Snir, Sagi
    Tuller, Tamir
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (01) : 324 - 337
  • [24] JIN KW, 2006, J HIGHWAY TRANSPORTA, V23, P128
  • [25] THE RAPID GENERATION OF MUTATION DATA MATRICES FROM PROTEIN SEQUENCES
    JONES, DT
    TAYLOR, WR
    THORNTON, JM
    [J]. COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (03): : 275 - 282
  • [26] Horizontal gene transfer: the path to maturity
    Koonin, EV
    [J]. MOLECULAR MICROBIOLOGY, 2003, 50 (03) : 725 - 727
  • [27] KUHNER MK, 1994, MOL BIOL EVOL, V11, P459
  • [28] Amelioration of bacterial genomes: Rates of change and exchange
    Lawrence, JG
    Ochman, H
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1997, 44 (04) : 383 - 397
  • [29] Deduction of probable events of lateral gene transfer through comparison of phylogenetic trees by recursive consolidation and rearrangement
    MacLeod, D
    Charlebois, RL
    Doolittle, F
    Bapteste, E
    [J]. BMC EVOLUTIONARY BIOLOGY, 2005, 5 (1)
  • [30] MADDISON D.R. A. K.-S. S. E., 2004, TREE LIFE WEB PROJEC