Taking variation of evolutionary rates between sites into account in inferring phylogenies

被引:73
作者
Felsenstein, J [1 ]
机构
[1] Univ Washington, Dept Genet, Seattle, WA 98195 USA
关键词
phylogeny; evolutionary rate; maximum likelihood; distance; parsimony; invariants;
D O I
10.1007/s002390010234
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As methods of molecular phylogeny have become more explicit and more biologically realistic following the pioneering work of Thomas Jukes, they have had to relax their initial assumption that rates of evolution were equal at all sites. Distance matrix and likelihood methods of inferring phylogenies make this assumption; parsimony, when valid, is less limited by it. Nucleotide sequences, including RNA sequences, can show substantial rate variation; protein sequences show rates that vary much more widely. Assuming a prior distribution of rates such as a gamma distribution or lognormal distribution has deservedly been popular, but for likelihood methods it leads to computational difficulties. These can be resolved using hidden Markov model (HMM) methods which approximate the distribution by one with a modest number of discrete rates. Generalized Laguerre quadrature can be used to improve the selection of rates and their probabilities so as to more nearly approach the desired gamma distribution. A model based on population genetics is presented predicting how the rates of evolution might vary from locus to locus. Challenges for the future include allowing rates at a given site to vary along the tree, as in the "covarion" model, and allowing them to have correlations that reflect three-dimensional structure, rather than position in the coding sequence. Markov chain Monte Carlo likelihood methods may be the only practical way to carry out computations for these models.
引用
收藏
页码:447 / 455
页数:9
相关论文
共 33 条
[1]  
[Anonymous], 1971, STAT DECISION THEORY
[2]  
[Anonymous], 1964, Handbook of mathematical functions
[3]   Molecular evolution modeled as a fractal Poisson process in agreement with mammalian sequence comparisons [J].
Bickel, DR ;
West, BJ .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (08) :967-977
[4]   INVARIANTS OF PHYLOGENIES IN A SIMPLE CASE WITH DISCRETE STATES [J].
CAVENDER, JA ;
FELSENSTEIN, J .
JOURNAL OF CLASSIFICATION, 1987, 4 (01) :57-71
[5]   Estimating divergence times in the presence of an overdispersed molecular clock [J].
Cutler, DJ .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (11) :1647-1660
[6]   A SUCCESSIVE APPROXIMATIONS APPROACH TO CHARACTER WEIGHTING [J].
FARRIS, JS .
SYSTEMATIC ZOOLOGY, 1969, 18 (04) :374-&
[7]   A hidden Markov Model approach to variation among sites in rate of evolution [J].
Felsenstein, J ;
Churchill, GA .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (01) :93-104
[8]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[10]   Self-weighted optimization: Tree searches and character state reconstructions under implied transformation costs [J].
Goloboff, PA .
CLADISTICS, 1997, 13 (03) :225-245