Phylogenetic inference using whole Genomes

被引:130
作者
Rannala, Bruce [1 ,2 ]
Yang, Ziheng [3 ,4 ]
机构
[1] Univ Calif Davis, Genome Ctr, Davis, CA 95616 USA
[2] Univ Calif Davis, Dept Ecol & Evolut, Davis, CA 95616 USA
[3] UCL, Dept Biol, London WC1E 6BT, England
[4] Univ Tokyo, Lab Biometr, Grad Sch Agr & Life Sci, Tokyo, Japan
基金
英国自然环境研究理事会;
关键词
genomic sequences; coalescent process; gene tree;
D O I
10.1146/annurev.genom.9.081307.164407
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The availability of genome-wide data provides unprecedented opportunities for resolving difficult phylogenetic relationships and for studying population genetic processes of mutation, selection, and recombination on a genomic scale. The use of appropriate statistical models becomes increasingly important when we are faced with very large datasets, which call lead to improved precision but not necessarily improved accuracy if the analytical methods have systematic biases. This review provides a critical examination of methods for analyzing genomic datasets from multiple loci, including concatenation, separate gene-by-gene analyses, and statistical models that accommodate heterogeneity in different aspects of the evolutionary process among data partitions. We discuss factors that may cause the gene tree to differ from the species tree, as well as strategies for estimating species phylogenies in the presence of gene tree conflicts. Genomic datasets provide computational and statistical challenges that are likely to be a focus of research for years to collie.
引用
收藏
页码:217 / 231
页数:15
相关论文
共 70 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
Ané C, 2007, MOL BIOL EVOL, V24, P412
[3]  
Bininda-Emonds O., 2004, PHYLOGENETIC SUPERTR
[4]   Supertree construction in the genomic age [J].
Bininda-Emonds, ORP .
MOLECULAR EVOLUTION: PRODUCING THE BIOCHEMICAL DATA, PART B, 2005, 395 :745-757
[5]   Variation in evolutionary processes at different codon positions [J].
Bofkin, Lee ;
Goldman, Nick .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (02) :513-521
[6]   Exploring among-site rate variation models in a maximum likelihood framework using empirical data: Effects of model assumptions on estimates of topology, branch lengths, and bootstrap support [J].
Buckley, TR ;
Simon, C ;
Chambers, GK .
SYSTEMATIC BIOLOGY, 2001, 50 (01) :67-86
[7]   Supertree bootstrapping methods for assessing phylogenetic variation among genes in genome-scale data sets [J].
Burleigh, J. Gordon ;
Driskell, Amy C. ;
Sanderson, Michael J. .
SYSTEMATIC BIOLOGY, 2006, 55 (03) :426-440
[8]   Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees [J].
Chen, FC ;
Li, WH .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (02) :444-456
[9]   The supermatrix approach to systematics [J].
de Queiroz, Alan ;
Gatesy, John .
TRENDS IN ECOLOGY & EVOLUTION, 2007, 22 (01) :34-41
[10]   Discordance of species trees with their most likely gene trees [J].
Degnan, James H. ;
Rosenberg, Noah A. .
PLOS GENETICS, 2006, 2 (05) :762-768