Modeling the site-specific variation of selection patterns along lineages

被引:120
作者
Guindon, S
Rodrigo, AG
Dyer, KA
Huelsenbeck, JP
机构
[1] Univ Rochester, Dept Biol, Rochester, NY 14627 USA
[2] Univ Auckland, Bioinformat Inst, Auckland, New Zealand
[3] Univ Auckland, Allan Wilson Ctr Mol Ecol & Evolut, Auckland, New Zealand
[4] Univ Calif San Diego, Sect Ecol Behav & Evolut, Div Biol Sci, La Jolla, CA 92093 USA
关键词
positive selection; codon-based model of nucleotide substitutions; phylogeny; maximum likelihood;
D O I
10.1073/pnas.0402177101
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The unambiguous footprint of positive Darwinian selection in protein-coding DNA sequences is revealed by an excess of nonsynonymous substitutions over synonymous substitutions compared with the neutral expectation. Methods for analyzing the patterns of nonsynonymous and synonymous substitutions usually rely on stochastic models in which the selection regime may vary across the sequence but remains constant across lineages for any amino acid position. Despite some work that has relaxed the constraint that selection patterns remain constant over time, no model provides a strong statistical framework to deal with switches between selection processes at individual sites during the course of evolution. This paper describes an approach that allows the site-specific selection process to vary along lineages of a phylogenetic tree. The parameters of the switching model of codon substitution are estimated by using maximum likelihood. The analysis of eight HIV-1 env homologous sequence data sets shows that this model provides a significantly better fit to the data than one that does not take into account switches between selection patterns in the phylogeny at individual sites. We also provide strong evidence that the strength and the frequency of occurrence of selection might not be estimated accurately when the site-specific variation of selection regimes is ignored.
引用
收藏
页码:12957 / 12962
页数:6
相关论文
共 34 条
[1]   The loss of the hemoglobin H2S-binding function in annelids from sulfide-free habitats reveals molecular adaptation driven by Darwinian positive selection [J].
Bailly, X ;
Leroy, R ;
Carney, S ;
Collin, O ;
Zal, F ;
Toulmond, A ;
Jollivet, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (10) :5885-5890
[2]   Positive selection at sites of multiple amino acid replacements since rat-mouse divergence [J].
Bazykin, GA ;
Kondrashov, FA ;
Ogurtsov, AY ;
Sunyaev, S ;
Kondrashov, AS .
NATURE, 2004, 429 (6991) :558-562
[3]   Positive and negative selection in the DAZ gene family [J].
Bielawski, JP ;
Yang, ZH .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (04) :523-529
[4]   Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios [J].
Clark, AG ;
Glanowski, S ;
Nielsen, R ;
Thomas, PD ;
Kejariwal, A ;
Todd, MA ;
Tanenbaum, DM ;
Civello, D ;
Lu, F ;
Murphy, B ;
Ferriera, S ;
Wang, G ;
Zheng, XG ;
White, TJ ;
Sninsky, JJ ;
Adams, MD ;
Cargill, M .
SCIENCE, 2003, 302 (5652) :1960-1963
[5]   Parallel evolution of drug resistance in HIV: Failure of nonsynonymous/synonymous substitution rate ratio to detect selection [J].
Crandall, KA ;
Kelsey, CR ;
Imamichi, H ;
Lane, HC ;
Salzman, NP .
MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (03) :372-382
[6]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[7]   A codon-based model of host-specific selection in parasites, with an application to the influenza A virus [J].
Forsberg, R ;
Christiansen, FB .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (08) :1252-1259
[8]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[9]   A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood [J].
Guindon, S ;
Gascuel, O .
SYSTEMATIC BIOLOGY, 2003, 52 (05) :696-704
[10]  
Haydon DT, 2001, GENETICS, V157, P7