Retrieving a mitogenomic mammal tree using composite taxa

被引:22
作者
Campbell, Veronique [1 ]
Lapointe, Francois-Joseph [1 ]
机构
[1] Univ Montreal, Dept Sci Biol, Montreal, PQ H3C 3J7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Composite sequences; Computer simulations; Congruence; DNA sequences; Missing data; Phylogenetic accuracy; Phylogenomics; Supermatrices; MAXIMUM-LIKELIHOOD-ESTIMATION; MISSING DATA; EVOLUTIONARY TREES; DNA-SEQUENCES; MITOCHONDRIAL GENOMES; PHYLOGENETIC ANALYSES; MOLECULAR PHYLOGENY; ANIMAL PHYLOGENY; INCOMPLETE TAXA; NUCLEAR GENES;
D O I
10.1016/j.ympev.2010.11.017
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Supermatrices are often characterized by a large amount of missing data. One possible approach to minimize such missing data is to create composite taxa. These taxa are formed by sampling sequences from different species in order to obtain a composite sequence that includes a maximum number of genes. Although this approach is increasingly used, its accuracy has rarely been tested and some authors prefer to analyze incomplete supermatrices by coding unavailable sequences as missing. To further validate the composite taxon approach, it was applied to complete mitochondrial matrices of 102 mammal species representing 93 families with varying amount of missing data. On average, missing data and composite matrices showed similar congruence to model trees obtained from the complete sequence matrix. As expected, the level of congruence to model trees decreased as missing data increased, with both approaches. We conclude that the composite taxon approach is worth considering in a phylogenomic context since it performs well and reduces computing time when compared to missing data matrices. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:149 / 156
页数:8
相关论文
共 97 条
[1]   Mitogenomic relationships of placental mammals and molecular estimates of their divergences [J].
Arnason, Ulfur ;
Adegoke, Joseph A. ;
Gullberg, Anette ;
Harley, Eric H. ;
Janke, Axel ;
Kullberg, Morgan .
GENE, 2008, 421 (1-2) :37-51
[2]   First combined cladistic analysis of marsupial mammal interrelationships [J].
Asher, RJ ;
Horovitz, I ;
Sánchez-Villagra, MR .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2004, 33 (01) :240-250
[3]   A dated phylogeny of marsupials using a molecular supermatrix and multiple fossil constraints [J].
Beck, Robin M. D. .
JOURNAL OF MAMMALOGY, 2008, 89 (01) :175-189
[4]   A review of long-branch attraction [J].
Bergsten, J .
CLADISTICS, 2005, 21 (02) :163-193
[5]   Assessment of the accuracy of matrix representation with parsimony analysis supertree construction [J].
Bininda-Emonds, ORP ;
Sanderson, MJ .
SYSTEMATIC BIOLOGY, 2001, 50 (04) :565-579
[6]   Testing the new animal phylogeny: A phylum level molecular analysis of the animal kingdom [J].
Bourlat, Sarah J. ;
Nielsen, Claus ;
Economou, Andrew D. ;
Telford, Maximilian J. .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2008, 49 (01) :23-31
[7]  
CAMPBELL V, DNA SIMULATION UNPUB
[8]   The Use and Validity of Composite Taxa in Phylogenetic Analysis [J].
Campbell, Veronique ;
Lapointe, Francois-Joseph .
SYSTEMATIC BIOLOGY, 2009, 58 (06) :560-572
[9]   Genomoc databases and the tree of life [J].
Crandall, KA ;
Buhay, JE .
SCIENCE, 2004, 306 (5699) :1144-1145
[10]   Tunicates and not cephalochordates are the closest living relatives of vertebrates [J].
Delsuc, F ;
Brinkmann, H ;
Chourrout, D ;
Philippe, H .
NATURE, 2006, 439 (7079) :965-968