An empirical assessment of long-branch attraction artefacts in deep eukaryotic phylogenomics

被引:189
作者
Brinkmann, H
Van der Giezen, M
Zhou, Y
De Raucourt, GP
Philippe, H
机构
[1] Univ Montreal, Dept Biochim, Ctr Robert Cedergren, Canadian Inst Adv Res, Montreal, PQ H3C 3J7, Canada
[2] Queen Mary Univ London, Sch Biol & Chem Sci, London E1 4NS, England
关键词
distant outgroup; eukaryotic tree; long-branch attraction; microsporidia; multigene data sets; nucleomorph; rooting; species sampling; systematic biases;
D O I
10.1080/10635150500234609
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the context of exponential growing molecular databases, it becomes increasingly easy to assemble large multi-gene data sets for phylogenomic studies. The expected increase of resolution due to the reduction of the sampling (stochastic) error is becoming a reality. However, the impact of systematic biases will also become more apparent or even dominant. We have chosen to study the case of the long-branch attraction artefact (LBA) using real instead of simulated sequences. Two fast-evolving eukaryotic lineages, whose evolutionary positions are well established, microsporidia and the nucleomorph of cryptophytes, were chosen as model species. A large data set was assembled (44 species, 133 genes, and 24,294 amino acid positions) and the resulting rooted eukaryotic phylogeny (using a distant archaeal outgroup) is positively misled by an LBA artefact despite the use of a maximum likelihood-based tree reconstruction method with a complex model of sequence evolution. When the fastest evolving proteins from the fast lineages are progressively removed ( up to 90%), the bootstrap support for the apparently artefactual basal placement decreases to virtually 0%, and conversely only the expected placement, among all the possible locations of the fast-evolving species, receives increasing support that eventually converges to 100%. The percentage of removal of the fastest evolving proteins constitutes a reliable estimate of the sensitivity of phylogenetic inference to LBA. This protocol confirms that both a rich species sampling ( especially the presence of a species that is closely related to the fast-evolving lineage) and a probabilistic method with a complex model are important to overcome the LBA artefact. Finally, we observed that phylogenetic inference methods perform strikingly better with simulated as opposed to real data, and suggest that testing the reliability of phylogenetic inference methods with simulated data leads to overconfidence in their performance. Although phylogenomic studies can be affected by systematic biases, the possibility of discarding a large amount of data containing most of the nonphylogenetic signal allows recovering a phylogeny that is less affected by systematic biases, while maintaining a high statistical support.
引用
收藏
页码:743 / 757
页数:15
相关论文
共 111 条
[11]   The power of relative rates tests depends on the data [J].
Bromham, L ;
Penny, D ;
Rambaut, A ;
Hendy, MD .
JOURNAL OF MOLECULAR EVOLUTION, 2000, 50 (03) :296-301
[12]   Evolution of monoblepharidalean fungi based on complete mitochondrial genome sequences [J].
Bullerwell, CE ;
Forget, L ;
Lang, BF .
NUCLEIC ACIDS RESEARCH, 2003, 31 (06) :1614-1623
[13]   Phylogenetic signal in nucleotide data from seed plants: Implications for resolving the seed plant tree of life [J].
Burleigh, JG ;
Mathews, S .
AMERICAN JOURNAL OF BOTANY, 2004, 91 (10) :1599-1613
[14]  
BURNHAM K.P., 2002, MODEL SELECTION MULT, P352, DOI DOI 10.1007/B97636
[15]   Systematics of primary osmotrophic euglenids:: a molecular approach to the phylogeny of Distigma and Astasia (Euglenozoa) [J].
Busse, I ;
Preisfeld, A .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2003, 53 :617-624
[16]   Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis [J].
Castresana, J .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (04) :540-552
[17]   Membrane heredity and early chloroplast evolution [J].
Cavalier-Smith, T .
TRENDS IN PLANT SCIENCE, 2000, 5 (04) :174-182
[18]   Oxymonads are closely related to the excavate taxon Trimastix [J].
Dacks, JB ;
Silberman, JD ;
Simpson, AGB ;
Moriya, S ;
Kudo, T ;
Ohkuma, M ;
Redfield, RJ .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (06) :1034-1044
[19]   Analyses of RNA polymerase II genes from free-living protists: Phylogeny, long branch attraction, and the eukaryotic big bang [J].
Dacks, JB ;
Marinets, A ;
Doolittle, WF ;
Cavalier-Smith, T ;
Logsdon, JM .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (06) :830-840
[20]   Phylogenomics and the reconstruction of the tree of life [J].
Delsuc, F ;
Brinkmann, H ;
Philippe, H .
NATURE REVIEWS GENETICS, 2005, 6 (05) :361-375