Effects of nucleotide composition bias on the success of the parsimony criterion in phylogenetic inference

被引:58
作者
Conant, GC
Lewis, PO
机构
[1] Univ New Mexico, Dept Biol, Albuquerque, NM 87131 USA
[2] Univ Connecticut, Dept Ecol & Evolutionary Biol, Storrs, CT 06269 USA
关键词
nucleotide composition; phylogeny; LogDet; G plus C bias; maximum parsimony;
D O I
10.1093/oxfordjournals.molbev.a003874
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Convergence in nucleotide composition (CNC) in unrelated lineages is a factor potentially affecting the performance of most phylogeny reconstruction methods. Such convergence has deleterious effects because unrelated lineages show similarities due to similar nucleotide compositions and not shared histories. While some methods (such as the LogDet/paralinear distance measure) avoid this pitfall, the amount of convergence in nucleotide composition necessary to deceive other phylogenetic methods has never been quantified. We examined analytically the relationship between convergence in nucleotide composition and the consistency of parsimony as a phylogenetic estimator for four taxa. Our results show that rather extreme amounts of convergence are necessary before parsimony begins to prefer the incorrect tree. Ancillary observations are that(for unweighted Fitch parsimony) transition/transversion bias contributes to the impact of CNC and, for a given amount of CNC and fixed branch lengths, data sets exhibiting substantial site-to-site rate heterogeneity present fewer difficulties than data sets in which rates are homogeneous. We conclude by reexamining a data set originally used to illustrate the problems caused by CNC. Using simulations, we show that in this case the convergence in nucleotide composition alone is insufficient to cause any commonly used methods to fail, and accounting for other evolutionary factors (such as site-to-site rate heterogeneity) can give a correct inference without accounting for CNC.
引用
收藏
页码:1024 / 1033
页数:10
相关论文
共 31 条
[1]   A PHYLOGENETIC ANALYSIS OF AQUIFEX-PYROPHILUS [J].
BURGGRAF, S ;
OLSEN, GJ ;
STETTER, KO ;
WOESE, CR .
SYSTEMATIC AND APPLIED MICROBIOLOGY, 1992, 15 (03) :352-356
[2]   CASES IN WHICH PARSIMONY OR COMPATIBILITY METHODS WILL BE POSITIVELY MISLEADING [J].
FELSENSTEIN, J .
SYSTEMATIC ZOOLOGY, 1978, 27 (04) :401-410
[3]  
Felsenstein J., 1993, PHYLIP PHYLOGENY INF
[4]   Compositional bias may affect both DNA-based and protein-based phylogenetic reconstructions [J].
Foster, PG ;
Hickey, DA .
JOURNAL OF MOLECULAR EVOLUTION, 1999, 48 (03) :284-290
[5]   Inferring pattern and process: Maximum-likelihood implementation of a nonhomogeneous model of DNA sequence evolution for phylogenetic analysis [J].
Galtier, N ;
Gouy, M .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (07) :871-879
[6]   INFERRING PHYLOGENIES FROM DNA-SEQUENCES OF UNEQUAL BASE COMPOSITIONS [J].
GALTIER, N ;
GOUY, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (24) :11317-11321
[7]   SUCCESS OF MAXIMUM-LIKELIHOOD PHYLOGENY INFERENCE IN THE 4-TAXON CASE [J].
GAUT, BS ;
LEWIS, PO .
MOLECULAR BIOLOGY AND EVOLUTION, 1995, 12 (01) :152-162
[8]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[9]   RIBOSOMAL-RNA TREES MISLEADING [J].
HASEGAWA, M ;
HASHIMOTO, T .
NATURE, 1993, 361 (6407) :23-23
[10]   PERFORMANCE OF PHYLOGENETIC METHODS IN SIMULATION [J].
HUELSENBECK, JP .
SYSTEMATIC BIOLOGY, 1995, 44 (01) :17-48