Retrieving a mitogenomic mammal tree using composite taxa

被引:22
作者
Campbell, Veronique [1 ]
Lapointe, Francois-Joseph [1 ]
机构
[1] Univ Montreal, Dept Sci Biol, Montreal, PQ H3C 3J7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Composite sequences; Computer simulations; Congruence; DNA sequences; Missing data; Phylogenetic accuracy; Phylogenomics; Supermatrices; MAXIMUM-LIKELIHOOD-ESTIMATION; MISSING DATA; EVOLUTIONARY TREES; DNA-SEQUENCES; MITOCHONDRIAL GENOMES; PHYLOGENETIC ANALYSES; MOLECULAR PHYLOGENY; ANIMAL PHYLOGENY; INCOMPLETE TAXA; NUCLEAR GENES;
D O I
10.1016/j.ympev.2010.11.017
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Supermatrices are often characterized by a large amount of missing data. One possible approach to minimize such missing data is to create composite taxa. These taxa are formed by sampling sequences from different species in order to obtain a composite sequence that includes a maximum number of genes. Although this approach is increasingly used, its accuracy has rarely been tested and some authors prefer to analyze incomplete supermatrices by coding unavailable sequences as missing. To further validate the composite taxon approach, it was applied to complete mitochondrial matrices of 102 mammal species representing 93 families with varying amount of missing data. On average, missing data and composite matrices showed similar congruence to model trees obtained from the complete sequence matrix. As expected, the level of congruence to model trees decreased as missing data increased, with both approaches. We conclude that the composite taxon approach is worth considering in a phylogenomic context since it performs well and reduces computing time when compared to missing data matrices. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:149 / 156
页数:8
相关论文
共 97 条
[31]   CLUSTAL - A PACKAGE FOR PERFORMING MULTIPLE SEQUENCE ALIGNMENT ON A MICROCOMPUTER [J].
HIGGINS, DG ;
SHARP, PM .
GENE, 1988, 73 (01) :237-244
[32]   APPROACHES FOR ASSESSING PHYLOGENETIC ACCURACY [J].
HILLIS, DM .
SYSTEMATIC BIOLOGY, 1995, 44 (01) :3-16
[33]   Is sparse taxon sampling a problem for phylogenetic inference? [J].
Hillis, DM ;
Pollock, DD ;
McGuire, JA ;
Zwickl, DJ .
SYSTEMATIC BIOLOGY, 2003, 52 (01) :124-126
[34]   Phylogenetic analyses of complete mitochondrial genome sequences suggest a basal divergence of the enigmatic rodent Anomalurus [J].
Horner, David S. ;
Lefkimmiatis, Konstantinos ;
Reyes, Aurelio ;
Gissi, Carmela ;
Saccone, Cecilia ;
Pesole, Graziano .
BMC EVOLUTIONARY BIOLOGY, 2007, 7 (1)
[35]   WHEN ARE FOSSILS BETTER THAN EXTANT TAXA IN PHYLOGENETIC ANALYSIS [J].
HUELSENBECK, JP .
SYSTEMATIC ZOOLOGY, 1991, 40 (04) :458-469
[36]   PERFORMANCE OF PHYLOGENETIC METHODS IN SIMULATION [J].
HUELSENBECK, JP .
SYSTEMATIC BIOLOGY, 1995, 44 (01) :17-48
[37]   MRBAYES: Bayesian inference of phylogenetic trees [J].
Huelsenbeck, JP ;
Ronquist, F .
BIOINFORMATICS, 2001, 17 (08) :754-755
[38]   Rates of genome evolution and branching order from whole genome analysis [J].
Huttley, Gavin A. ;
Wakefield, Matthew J. ;
Easteal, Simon .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (08) :1722-1730
[39]  
Ihaka R., 1996, J. Comput. Graph. Stat., V5, P299, DOI [10.2307/1390807, 10.1080/10618600.1996.10474713, DOI 10.1080/10618600.1996.10474713]
[40]   Phylogenomics: the beginning of incongruence? [J].
Jeffroy, O ;
Brinkmann, H ;
Delsuc, F ;
Philippe, H .
TRENDS IN GENETICS, 2006, 22 (04) :225-231