Animal phylogeny and large-scale sequencing: progress and pitfalls

被引:40
作者
Brinkmann, Henner [1 ]
Philippe, Herve [1 ]
机构
[1] Univ Montreal, Dept Biochim, Ctr Robert Cedergren, Montreal, PQ H3T 1J4, Canada
关键词
long-branch attraction (LBA) artifact; new animal phylogeny; phylogenomics; random error; systematic error;
D O I
10.3724/SP.J.1002.2008.08038
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Phylogenomics, the inference of phylogenetic trees using genome-scale data, is becoming the rule for resolving difficult parts of the tree of life. Its promise resides in the large amount of information available, which should eliminate stochastic error. However, systematic error, which is due to limitations of reconstruction methods, is becoming more apparent. We will illustrate, using animal phylogeny as a case study, the three most efficient approaches to avoid the pitfalls of phylogenomics: (1) using a dense taxon sampling, (2) using probabilistic methods with complex models of sequence evolution that more accurately detect multiple substitutions, and (3) removing the fastest evolving part of the data (e.g., species and positions). The analysis of a dataset of 55 animal species and 102 proteins (25712 amino acid positions) shows that standard site-homogeneous model inference is sensitive to long-branch attraction artifact, whereas the site-heterogeneous CAT model is less so. The latter model correctly locates three very fast evolving species, the appendicularian tunicate Oikopleura, the acoel Convoluta and the myxozoan Buddenbrockia. Overall, the resulting tree is in excellent agreement with the new animal phylogeny, confirming that "simple" organisms like platyhelminths and nematodes are not necessarily of basal emergence. This further emphasizes the importance of secondary simplification in animals, and for organismal evolution in general.
引用
收藏
页码:274 / 286
页数:13
相关论文
共 90 条
[71]   Acoel flatworms:: Earliest extant bilaterian metazoans, not members of Platyhelminthes [J].
Ruiz-Trillo, I ;
Riutort, M ;
Littlewood, DTJ ;
Herniou, EA ;
Baguñà, J .
SCIENCE, 1999, 283 (5409) :1919-1923
[72]   THE NUMBER OF NUCLEOTIDES REQUIRED TO DETERMINE THE BRANCHING ORDER OF 3 SPECIES, WITH SPECIAL REFERENCE TO THE HUMAN-CHIMPANZEE-GORILLA DIVERGENCE [J].
SAITOU, N ;
NEI, M .
JOURNAL OF MOLECULAR EVOLUTION, 1986, 24 (1-2) :189-204
[73]   Troubleshooting molecular phylogenetic analyses [J].
Sanderson, MJ ;
Shaffer, HB .
ANNUAL REVIEW OF ECOLOGY AND SYSTEMATICS, 2002, 33 :49-72
[74]   TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing [J].
Schmidt, HA ;
Strimmer, K ;
Vingron, M ;
von Haeseler, A .
BIOINFORMATICS, 2002, 18 (03) :502-504
[75]   Phylogenetic analysis of arthropods using two nuclear protein-encoding genes supports a crustacean plus hexapod clade [J].
Shultz, JW ;
Regier, JC .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2000, 267 (1447) :1011-1019
[76]   A MOLECULAR EVOLUTIONARY FRAMEWORK FOR EUKARYOTIC MODEL ORGANISMS [J].
SIDOW, A ;
THOMAS, WK .
CURRENT BIOLOGY, 1994, 4 (07) :596-603
[77]   RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees [J].
Stamatakis, A ;
Ludwig, T ;
Meier, H .
BIOINFORMATICS, 2005, 21 (04) :456-463
[78]   Should we use model-based methods for phylogenetic inference when we know that assumptions about among-site rate variation and nucleotide substitution pattern are violated? [J].
Sullivan, J ;
Swofford, DL .
SYSTEMATIC BIOLOGY, 2001, 50 (05) :723-729
[79]  
Sullivan JP, 2000, J EXP BIOL, V203, P665
[80]   IQPNNI: Moving fast through tree space and stopping in time [J].
Vinh, LS ;
von Haeseler, A .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (08) :1565-1571