Tracing the decay of the historical signal in biological sequence data

被引:120
作者
Ho, SYW
Jermiin, LS
机构
[1] Univ Sydney, Sch Biol Sci, Sydney, NSW 2006, Australia
[2] Univ Sydney, Sydney Univ Biol Informat & Technol Ctr, Sydney, NSW 2006, Australia
基金
澳大利亚研究理事会;
关键词
compositional heterogeneity; edge lengths; Monte Carlo simulation; networks; phylogenetic signal; rate heterogeneity; substitutional saturation;
D O I
10.1080/10635150490503035
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Alignments of nucleotide or amino acid sequences may contain a variety of different signals, one of which is the historical signal that we often try to recover by phylogenetic analysis. Other signals, such as those arising due to compositional heterogeneities, among-lineage and among-site rate heterogeneities, invariant sites, and covariotides, may interfere adversely with the recovery of the historical signal. The effect of the interaction of these signals on phylogenetic inference is not well understood and may, in many cases, even be underappreciated. In this study, we investigate this matter and present results based on Monte Carlo simulations. We explored the success of four phylogenetic methods in recovering the true tree from data that had evolved under conditions where the equilibrium base frequencies and substitution rates were allowed to vary among lineages. Seven scenarios with increasingly complex conditions were investigated. All of the methods tested, with the exception of neighbor-joining using LogDet distances, were sensitive to compositional convergence in nonsister lineages. Maximum parsimony was also susceptible to attraction between long edges. In many cases, however, phylogenetic inference methods can still recover the true tree when misleading signals are present, in some instances even when the historical signal is no longer dominant. These results highlight the growing need for simple methods to detect violation of the phylogenetic assumptions.
引用
收藏
页码:623 / 637
页数:15
相关论文
共 65 条
[1]   Accelerated evolution of cytochrome b in simian primates:: Adaptive evolution in concert with other mitochondrial proteins? [J].
Andrews, TD ;
Jermiin, LS ;
Easteal, S .
JOURNAL OF MOLECULAR EVOLUTION, 1998, 47 (03) :249-257
[2]   ANIMALS AND FUNGI ARE EACH OTHERS CLOSEST RELATIVES - CONGRUENT EVIDENCE FROM MULTIPLE PROTEINS [J].
BALDAUF, SL ;
PALMER, JD .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (24) :11558-11562
[3]   Split Decomposition: A New and Useful Approach to Phylogenetic Analysis of Distance Data [J].
Bandelt, Hans-Juergen ;
Dress, Andreas W. M. .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 1992, 1 (03) :242-252
[4]   Topological bias and inconsistency of maximum likelihood using wrong models [J].
Bruno, WJ ;
Halpern, AL .
MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (04) :564-566
[5]   Weighted neighbor joining: A likelihood-based approach to distance-based phylogeny reconstruction [J].
Bruno, WJ ;
Socci, ND ;
Halpern, AL .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) :189-197
[6]   Bias in phylogenetic reconstruction of vertebrate rhodopsin sequences [J].
Chang, BSW ;
Campbell, DL .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (08) :1220-1231
[7]   Inconsistency of evolutionary tree topology reconstruction methods when substitution rates vary across characters [J].
Chang, JT .
MATHEMATICAL BIOSCIENCES, 1996, 134 (02) :189-215
[8]   Effects of nucleotide composition bias on the success of the parsimony criterion in phylogenetic inference [J].
Conant, GC ;
Lewis, PO .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (06) :1024-1033
[9]  
DEBRY RW, 1992, MOL BIOL EVOL, V9, P537
[10]  
Edlind TD, 1996, MOL PHYLOGENET EVOL, V5, P359