A genetic algorithm approach to detecting lineage-specific variation in selection pressure

被引:153
作者
Pond, SLK [1 ]
Frost, SDW [1 ]
机构
[1] Univ Calif San Diego, Antiviral Res Ctr, San Diego, CA USA
关键词
lineage-specific selection; model selection; genetic algorithms; branch site model; codon substitution model;
D O I
10.1093/molbev/msi031
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The ratio of nonsynonymous (dN) to synonymous (dS) substitution rates, omega, provides a measure of selection at the protein level. Models have been developed that allow omega to vary among lineages. However, these models require the lineages in which differential selection has acted to be specified a priori. We propose a genetic algorithm approach to assign lineages in a phylogeny to a fixed number of different classes of to, thus allowing variable selection pressure without a priori specification of particular lineages. This approach can identify models with a better fit than a single-ratio model, and with fits that are better than (in an information theoretic sense) a fully local model, in which all lineages are assumed to evolve under different values of w, but with far fewer parameters. By averaging over models which explain the data reasonably well, we can assess the robustness of our conclusions to uncertainty in model estimation. Our approach can also be used to compare results from models in which branch classes are specified a priori with a wide range of credible models. We illustrate our methods on primate lysozyme sequences and compare them with previous methods applied to the same data sets.
引用
收藏
页码:478 / 485
页数:8
相关论文
共 23 条
  • [1] Abramowitz M., 1972, HDB MATH FUNCTIONS F
  • [2] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [3] AKAIKE H, 1983, INT STAT I, V44, P139
  • [4] Genetic algorithms and parallel processing in maximum-likelihood phylogeny inference
    Brauer, MJ
    Holder, MT
    Dries, LA
    Zwickl, DJ
    Lewis, PO
    Hillis, DM
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (10) : 1717 - 1726
  • [5] Burnham KP., 2007, Model selection and multimodel inference: a practical information-theoretic approach
  • [6] ESHELMAN L, 1991, FOGA 1
  • [7] GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
  • [8] Genetic algorithm-based maximum-likelihood analysis for molecular phylogeny
    Katoh, K
    Kuma, K
    Miyata, T
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 2001, 53 (4-5) : 477 - 484
  • [9] Kim YH, 2003, LECT NOTES COMPUT SC, V2724, P2168
  • [10] ON INFORMATION AND SUFFICIENCY
    KULLBACK, S
    LEIBLER, RA
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (01): : 79 - 86