Models of coding sequence evolution

被引:56
作者
Delport, Wayne
Scheffler, Konrad [3 ]
Seoighe, Cathal [1 ,2 ]
机构
[1] Univ Cape Town, Inst Infect Dis & Mol Med, ZA-7925 Cape Town, South Africa
[2] Univ Western Cape, ZA-7535 Bellville, South Africa
[3] Univ Stellenbosch, ZA-7600 Stellenbosch, South Africa
关键词
AMINO-ACID SITES; DETECTING POSITIVE SELECTION; CODON-SUBSTITUTION MODELS; ADAPTIVE EVOLUTION; LIKELIHOOD METHOD; MAXIMUM-LIKELIHOOD; BAYESIAN-INFERENCE; NUCLEOTIDE SUBSTITUTION; PHYLOGENETIC ANALYSIS; MOLECULAR EVOLUTION;
D O I
10.1093/bib/bbn049
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Probabilistic models of sequence evolution are in widespread use in phylogenetics and molecular sequence evolution. These models have become increasingly sophisticated and combined with statistical model comparison techniques have helped to shed light on how genes and proteins evolve. Models of codon evolution have been particularly useful, because, in addition to providing a significant improvement in model realism for protein-coding sequences, codon models can also be designed to test hypotheses about the selective pressures that shape the evolution of the sequences. Such models typically assume a phylogeny and can be used to identify sites or lineages that have evolved adaptively. Recently some of the key assumptions that underlie phylogenetic tests of selection have been questioned, such as the assumption that the rate of synonymous changes is constant across sites or that a single phylogenetic tree can be assumed at all sites for recombining sequences. While some of these issues have been addressed through the development of novel methods, others remain as caveats that need to be considered on a case-by-case basis. Here, we outline the theory of codon models and their application to the detection of positive selection. We review some of the more recent developments that have improved their power and utility, laying a foundation for further advances in the modeling of coding sequence evolution.
引用
收藏
页码:97 / 109
页数:13
相关论文
共 102 条
[1]  
Akaike H., 1973, INT S INF THEOR AK K
[2]  
Anisimova M, 2003, GENETICS, V164, P1229
[3]   Accuracy and power of Bayes prediction of amino acid sites under positive selection [J].
Anisimova, M ;
Bielawski, JP ;
Yang, ZH .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (06) :950-958
[4]   Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites [J].
Anisimova, Maria ;
Yang, Ziheng .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (05) :1219-1228
[5]   Likelihood-based clustering (LiBaC) for codon models, a method for grouping sites according to similarities in the underlying process of evolution [J].
Bao, Le ;
Gu, Hong ;
Dunn, Katherine A. ;
Bielawski, Joseph P. .
MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (09) :1995-2007
[6]   Methods for selecting fixed-effect models for heterogeneous codon evolution, with comments on their application to gene and genome data [J].
Bao, Le ;
Gu, Hong ;
Dunn, Katherine A. ;
Bielawski, Joseph P. .
BMC EVOLUTIONARY BIOLOGY, 2007, 7 (Suppl 1)
[7]   Positive selection at sites of multiple amino acid replacements since rat-mouse divergence [J].
Bazykin, GA ;
Kondrashov, FA ;
Ogurtsov, AY ;
Sunyaev, S ;
Kondrashov, AS .
NATURE, 2004, 429 (6991) :558-562
[8]   Bursts of nonsynonymous substitutions in HIV-1 evolution reveal instances of positive selection at conservative protein sites [J].
Bazykin, Georgii A. ;
Dushoff, Jonathan ;
Levin, Simon A. ;
Kondrashov, Alexey S. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (51) :19396-19401
[9]   A site- and time-heterogeneous model of amino acid replacement [J].
Blanquart, Samuel ;
Lartillot, Nicolas .
MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (05) :842-858
[10]   A Bayesian compound stochastic process for modeling nonstationary and nonhomogeneous sequence evolution [J].
Blanquart, Samuel ;
Lartillot, Nicolas .
MOLECULAR BIOLOGY AND EVOLUTION, 2006, 23 (11) :2058-2071