Assessment of phylogenomic and orthology approaches for phylogenetic inference

被引:51
作者
Dutilh, B. E.
van Noort, V.
van der Heijden, R. T. J. M.
Boekhout, T.
Snel, B.
Huynen, M. A.
机构
[1] Radboud Univ Nijmegen, Nijmegen Med Ctr, Nijmegen Ctr Mol Life Sci, Ctr Mol & Biomol Informat, NL-6500 HB Nijmegen, Netherlands
[2] Cent Bur Schimmelcultures, NL-3584 CT Utrecht, Netherlands
[3] Univ Utrecht, Dept Biol, Bioinformat Grp, NL-3584 CH Utrecht, Netherlands
关键词
D O I
10.1093/bioinformatics/btm015
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Phylogenomics integrates the vast amount of phylogenetic information contained in complete genome sequences, and is rapidly becoming the standard for reliably inferring species phylogenies. There are, however, fundamental differences between the ways in which phylogenomic approaches like gene content, superalignment, superdistance and supertree integrate the phylogenetic information from separate orthologous groups. Furthermore, they all depend on the method by which the orthologous groups are initially determined. Here, we systematically compare these four phylogenomic approaches, in parallel with three approaches for large-scale orthology determination: pairwise orthology, cluster orthology and tree-based orthology. Results: Including various phylogenetic methods, we apply a total of 54 fully automated phylogenomic procedures to the fungi, the eukaryotic clade with the largest number of sequenced genomes, for which we retrieved a golden standard phylogeny from the literature. Phylogenomic trees based on gene content show, relative to the other methods, a bias in the tree topology that parallels convergence in lifestyle among the species compared, indicating convergence in gene content. Conclusions: Complete genomes are no guarantee for good or even consistent phylogenies. However, the large amounts of data in genomes enable us to carefully select the data most suitable for phylogenomic inference. In terms of performance, the superalignment approach, combined with restrictive orthology, is the most successful in recovering a fungal phylogeny that agrees with current taxonomic views, and allows us to obtain a high-resolution phylogeny. We provide solid support for what has grown to be a common practice in phylogenomics during its advance in recent years.
引用
收藏
页码:815 / 824
页数:10
相关论文
共 68 条
[1]   Lateral gene transfer in eukaryotes [J].
Andersson, JO .
CELLULAR AND MOLECULAR LIFE SCIENCES, 2005, 62 (11) :1182-1197
[2]  
[Anonymous], 1989, Cladistics, DOI DOI 10.1111/J.1096-0031.1989.TB00562.X
[3]   Ribosomal DNA and resolution of branching order among the ascomycota: How many nucleotides are enough? [J].
Berbee, ML ;
Carmean, DA ;
Winka, K .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2000, 17 (03) :337-344
[4]   The evolution of supertrees [J].
Bininda-Emonds, ORP .
TRENDS IN ECOLOGY & EVOLUTION, 2004, 19 (06) :315-322
[5]   Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis [J].
Castresana, J .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (04) :540-552
[6]   Toward automatic reconstruction of a highly resolved tree of life [J].
Ciccarelli, FD ;
Doerks, T ;
von Mering, C ;
Creevey, CJ ;
Snel, B ;
Bork, P .
SCIENCE, 2006, 311 (5765) :1283-1287
[7]   Finding functional features in Saccharomyces genomes by phylogenetic footprinting [J].
Cliften, P ;
Sudarsanam, P ;
Desikan, A ;
Fulton, L ;
Fulton, B ;
Majors, J ;
Waterston, R ;
Cohen, BA ;
Johnston, M .
SCIENCE, 2003, 301 (5629) :71-76
[8]   Clann: investigating phylogenetic information through supertree analyses [J].
Creevey, CJ ;
McInerney, JO .
BIOINFORMATICS, 2005, 21 (03) :390-392
[9]  
CRISCUOLO A, 2006, IN PRESS SYST BIOL
[10]   The tree of one percent [J].
Dagan, Tal ;
Martin, William .
GENOME BIOLOGY, 2006, 7 (10)