Species trees from gene trees: Reconstructing Bayesian posterior distributions of a species phylogeny using estimated gene tree distributions

被引:367
作者
Liu, Liang [1 ]
Pearl, Dennis K. [1 ]
机构
[1] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
D O I
10.1080/10635150701429982
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The desire to infer the evolutionary history of a group of species should be more viable now that a considerable amount of multilocus molecular data is available. However, the current molecular phylogenetic paradigm still reconstructs gene trees to represent the species tree. Further, commonly used methods of combining data, such as the concatenation method, are known to be inconsistent in some circumstances. In this paper, we propose a Bayesian hierarchical model to estimate the phylogeny of a group of species using multiple estimated gene tree distributions, such as those that arise in a Bayesian analysis of DNA sequence data. Our model employs substitution models used in traditional phylogenetics but also uses coalescent theory to explain genealogical signals from species trees to gene trees and from gene trees to sequence data, thereby forming a complete stochastic model to estimate gene trees, species trees, ancestral population sizes, and species divergence times simultaneously. Our model is founded on the assumption that gene trees, even of unlinked loci, are correlated due to being derived from a single species tree and therefore should be estimated jointly. We apply the method to two multilocus data sets of DNA sequences. The estimates of the species tree topology and divergence times appear to be robust to the prior of the population size, whereas the estimates of effective population sizes are sensitive to the prior used in the analysis. These analyses also suggest that the model is superior to the concatenation method in fitting these data sets and thus provides a more realistic assessment of the variability in the distribution of the species tree that may have produced the molecular information at hand. Future improvements of our model and algorithm should include consideration of other factors that can cause discordance of gene trees and species trees, such as horizontal transfer or gene duplication.
引用
收藏
页码:504 / 514
页数:11
相关论文
共 55 条
[1]   Parallel metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference [J].
Altekar, G ;
Dwarkadas, S ;
Huelsenbeck, JP ;
Ronquist, F .
BIOINFORMATICS, 2004, 20 (03) :407-415
[2]  
[Anonymous], 2004, PHYLIP PHYLOGENY INF
[3]  
Arvestad L., 2004, P 8 ANN INT C RESAER, P326, DOI DOI 10.1145/974614.974657
[4]   Origin and evolution of the AmpC β-lactamases of Citrobacter freundii [J].
Barlow, M ;
Hall, BG .
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 2002, 46 (05) :1190-1198
[5]   Differentiating between hypotheses of lineage sorting and introgression in New Zealand alpine cicadas (Maoricicada Dugdale) [J].
Buckley, Thomas R. ;
Cordeiro, Michael ;
Marshall, David C. ;
Simon, Chris .
SYSTEMATIC BIOLOGY, 2006, 55 (03) :411-425
[6]   PARTITIONING AND COMBINING DATA IN PHYLOGENETIC ANALYSIS [J].
BULL, JJ ;
HUELSENBECK, JP ;
CUNNINGHAM, CW ;
SWOFFORD, DL ;
WADDELL, PJ .
SYSTEMATIC BIOLOGY, 1993, 42 (03) :384-397
[7]   Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees [J].
Chen, FC ;
Li, WH .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (02) :444-456
[8]   Ancestral inference on gene trees under selection [J].
Coop, G ;
Griffiths, RC .
THEORETICAL POPULATION BIOLOGY, 2004, 66 (03) :219-232
[9]   Going nuclear: gene family evolution and vertebrate phylogeny reconciled [J].
Cotton, JA ;
Page, RDM .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2002, 269 (1500) :1555-1561
[10]   Discordance of species trees with their most likely gene trees [J].
Degnan, James H. ;
Rosenberg, Noah A. .
PLOS GENETICS, 2006, 2 (05) :762-768