Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models

被引:107
作者
Anisimova, Maria [1 ,2 ]
Kosiol, Carolin [3 ]
机构
[1] Swiss Fed Inst Technol ETHZ, Inst Computat Sci, Zurich, Switzerland
[2] Swiss Inst Bioinformat, Lausanne, Switzerland
[3] Cornell Univ, Dept Biol Stat & Computat Biol, Ithaca, NY 14853 USA
基金
美国国家科学基金会;
关键词
AMINO-ACID SITES; DETECTING POSITIVE SELECTION; MAXIMUM-LIKELIHOOD-ESTIMATION; NONSYNONYMOUS NUCLEOTIDE SUBSTITUTION; DNA-SEQUENCES; MOLECULAR EVOLUTION; ADAPTIVE EVOLUTION; GENETIC ALGORITHM; BAYESIAN-INFERENCE; NATURAL-SELECTION;
D O I
10.1093/molbev/msn232
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This review is motivated by the true explosion in the number of recent studies both developing and ameliorating probabilistic models of codon evolution. Traditionally parametric, the first codon models focused on estimating the effects of selective pressure on the protein via an explicit parameter in the maximum likelihood framework. Likelihood ratio tests of nested codon models armed the biologists with powerful tools, which provided unambiguous evidence for positive selection in real data. This, in turn, triggered a new wave of methodological developments. The new generation of models views the codon evolution process in a more sophisticated way, relaxing several mathematical assumptions. These models make a greater use of physicochemical amino acid properties, genetic code machinery, and the large amounts of data from the public domain. The overview of the most recent advances on modeling codon evolution is presented here, and a wide range of their applications to real data is discussed. On the downside, availability of a large variety of models, each accounting for various biological factors, increases the margin for misinterpretation; the biological meaning of certain parameters may vary among models, and model selection procedures also deserve greater attention. Solid understanding of the modeling assumptions and their applicability is essential for successful statistical data analysis.
引用
收藏
页码:255 / 271
页数:17
相关论文
共 185 条
  • [21] AMINO-ACID SUBSTITUTION DURING FUNCTIONALLY CONSTRAINED DIVERGENT EVOLUTION OF PROTEIN SEQUENCES
    BENNER, SA
    COHEN, MA
    GONNET, GH
    [J]. PROTEIN ENGINEERING, 1994, 7 (11): : 1323 - 1332
  • [22] Bielawski Joseph P., 2003, Journal of Structural and Functional Genomics, V3, P201, DOI 10.1023/A:1022642807731
  • [23] A maximum likelihood method for detecting functional divergence at individual codon sites, with application to gene family evolution
    Bielawski, JP
    Yang, ZH
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 2004, 59 (01) : 121 - 132
  • [24] A site- and time-heterogeneous model of amino acid replacement
    Blanquart, Samuel
    Lartillot, Nicolas
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (05) : 842 - 858
  • [25] Variation in evolutionary processes at different codon positions
    Bofkin, Lee
    Goldman, Nick
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (02) : 513 - 521
  • [26] SIMMAP: Stochastic character mapping of discrete traits on phylogenies
    Bollback, JP
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [27] Transducers: an emerging probabilistic framework for modeling indels on trees
    Bradley, Robert K.
    Holmes, Ian
    [J]. BIOINFORMATICS, 2007, 23 (23) : 3258 - 3262
  • [28] Disease progression and evolution of the HIV-1 env gene in 24 infected infants
    Carvajal-Rodriguez, Antonio
    Posada, David
    Perez-Losada, Marcos
    Keller, Emily
    Abrams, Elaine J.
    Viscidi, Raphael P.
    Crandall, Keith A.
    [J]. INFECTION GENETICS AND EVOLUTION, 2008, 8 (02) : 110 - 120
  • [29] Hearing silence: non-neutral evolution at synonymous sites in mammals
    Chamary, JV
    Parmley, JL
    Hurst, LD
    [J]. NATURE REVIEWS GENETICS, 2006, 7 (02) : 98 - 108
  • [30] Ancestral gene reconstruction and synthesis of ancient rhodopsins in the laboratory
    Chang, BSW
    [J]. INTEGRATIVE AND COMPARATIVE BIOLOGY, 2003, 43 (04) : 500 - 507