Detecting and overcoming systematic errors in genome-scale phylogenies

被引:226
作者
Rodriguez-Ezpeleta, Naiara
Brinkmann, Henner
Roure, Beatrice
Lartillot, Nicolas
Lang, B. Franz
Philippe, Herve
机构
[1] Univ Montreal, Dept Biochim, Canadian Inst Adv Res, Ctr Robert Cedergren, Montreal, PQ H3T 1J4, Canada
[2] Univ Montpellier 2, CNRS, UMR 5506, Lab Informat Robot & Microelect Montpellier, F-34392 Montpellier 5, France
基金
加拿大健康研究院; 加拿大创新基金会;
关键词
compositional heterogeneity; data removal; eukaryotic phylogeny; inconsistency; long-branch attraction; nonphylogenetic signal; phylogenomics; systematic error;
D O I
10.1080/10635150701397643
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genome-scale data sets result in an enhanced resolution of the phylogenetic inference by reducing stochastic errors. However, there is also an increase of systematic errors due to model violations, which can lead to erroneous phylogenies. Here, we explore the impact of systematic errors on the resolution of the eukaryotic phylogeny using a data set of 143 nuclear-encoded proteins from 37 species. The initial observation was that, despite the impressive amount of data, some branches had no significant statistical support. To demonstrate that this lack of resolution is due to a mutual annihilation of phylogenetic and nonphylogenetic signals, we created a series of data sets with slightly different taxon sampling. As expected, these data sets yielded strongly supported but mutually exclusive trees, thus confirming the presence of conflicting phylogenetic and nonphylogenetic signals in the original data set. To decide on the correct tree, we applied several methods expected to reduce the impact of some kinds of systematic error. Briefly, we show that (i) removing fast-evolving positions, (ii) recoding amino acids into functional categories, and (iii) using a site-heterogeneous mixture model (CAT) are three effective means of increasing the ratio of phylogenetic to nonphylogenetic signal. Finally, our results allow us to formulate guidelines for detecting and overcoming phylogenetic artefacts in genome-scale phylogenetic analyses.
引用
收藏
页码:389 / 399
页数:11
相关论文
共 78 条
[1]   Matched-pairs tests of homogeneity with applications to homologous nucleotide sequences [J].
Ababneh, F ;
Jermiin, LS ;
Ma, CS ;
Robinson, J .
BIOINFORMATICS, 2006, 22 (10) :1225-1231
[2]  
ADACHI J, 1996, COMPUT SCI MONOGR, V28, P1
[3]   Evidence for a clade of nematodes, arthropods and other moulting animals [J].
Aguinaldo, AMA ;
Turbeville, JM ;
Linford, LS ;
Rivera, MC ;
Garey, JR ;
Raff, RA ;
Lake, JA .
NATURE, 1997, 387 (6632) :489-493
[4]   A kingdom-level phylogeny of eukaryotes based on combined protein data [J].
Baldauf, SL ;
Roger, AJ ;
Wenk-Siefert, I ;
Doolittle, WF .
SCIENCE, 2000, 290 (5493) :972-977
[5]   The analysis of 100 genes supports the grouping of three highly divergent amoebae:: Dictyostelium, Entamoeba, and Mastigamoeba [J].
Bapteste, E ;
Brinkmann, H ;
Lee, JA ;
Moore, DV ;
Sensen, CW ;
Gordon, P ;
Duruflé, L ;
Gaasterland, T ;
Lopez, P ;
Müller, M ;
Philippe, H .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (03) :1414-1419
[6]   Lack of resolution in the animal phylogeny: Closely spaced cladogeneses or undetected systematic errors? [J].
Baurain, Denis ;
Brinkmann, Henner ;
Philippe, Herve .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (01) :6-9
[7]   A Bayesian compound stochastic process for modeling nonstationary and nonhomogeneous sequence evolution [J].
Blanquart, Samuel ;
Lartillot, Nicolas .
MOLECULAR BIOLOGY AND EVOLUTION, 2006, 23 (11) :2058-2071
[8]   A TEST FOR SYMMETRY IN CONTINGENCY TABLES [J].
BOWKER, AH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1948, 43 (244) :572-574
[9]   An empirical assessment of long-branch attraction artefacts in deep eukaryotic phylogenomics [J].
Brinkmann, H ;
Van der Giezen, M ;
Zhou, Y ;
De Raucourt, GP ;
Philippe, H .
SYSTEMATIC BIOLOGY, 2005, 54 (05) :743-757
[10]   Archaea sister group of bacteria? Indications from tree reconstruction artifacts in ancient phylogenies [J].
Brinkmann, H ;
Philippe, H .
MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (06) :817-825