A site- and time-heterogeneous model of amino acid replacement

被引:138
作者
Blanquart, Samuel [1 ]
Lartillot, Nicolas [1 ]
机构
[1] Univ Montpellier 2, Lab Informat Robot & Microelect Montpellier, CNRS, UMR 5506, Montpellier, France
关键词
phylogeny; MCMC; nonstationary; mixture; posterior predictive; model violation; LBA;
D O I
10.1093/molbev/msn018
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We combined the category ( CAT) mixture model ( Lartillot N, Philippe H. 2004) and the nonstationary break point ( BP) model ( Blanquart S, Lartillot N. 2006) into a new model, CAT - BP, accounting for variations of the evolutionary process both along the sequence and across lineages. As in CAT, the model implements a mixture of distinct Markovian processes of substitution distributed among sites, thus accommodating site- specific selective constraints induced by protein structure and function. Furthermore, as in BP, these processes are nonstationary, and their equilibrium frequencies are allowed to change along lineages in a correlated way, through discrete shifts in global amino acid composition distributed along the phylogenetic tree. We implemented the CAT - BP model in a Bayesian Markov Chain Monte Carlo framework and compared its predictions with those of 3 simpler models, BP, CAT, and the site- and timehomogeneous general time - reversible ( GTR) model, on a concatenation of 4 mitochondrial proteins of 20 arthropod species. In contrast to GTR, BP, and CAT, which all display a phylogenetic reconstruction artifact positioning the bees Apis mellifera and Melipona bicolor among chelicerates, the CAT - BP model is able to recover the monophyly of insects. Using posterior predictive tests, we further show that the CAT - BP combination yields better anticipations of site- and taxon- specific amino acid frequencies and that it better accounts for the homoplasies that are responsible for the artifact. Altogether, our results show that the joint modeling of heterogeneities across sites and along time results in a synergistic improvement of the phylogenetic inference, indicating that it is essential to disentangle the combined effects of both sources of heterogeneity, in order to overcome systematic errors in protein phylogenetic analyses.
引用
收藏
页码:842 / 858
页数:17
相关论文
共 70 条
  • [51] Markov chain sampling methods for Dirichlet process mixture
    Neal, RM
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2000, 9 (02) : 249 - 265
  • [52] Nielsen R, 2002, Pac Symp Biocomput, P576
  • [53] Mapping mutations on phylogenies
    Nielsen, R
    [J]. SYSTEMATIC BIOLOGY, 2002, 51 (05) : 729 - 739
  • [54] Protein evolution with dependence among codons due to tertiary structure
    Robinson, DM
    Jones, DT
    Kishino, H
    Goldman, N
    Thorne, JL
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (10) : 1692 - 1704
  • [55] Site interdependence attributed to tertiary structure in amino acid sequence evolution
    Rodrigue, N
    Lartillot, N
    Bryant, D
    Philippe, H
    [J]. GENE, 2005, 347 (02) : 207 - 217
  • [56] THE GENERAL STOCHASTIC-MODEL OF NUCLEOTIDE SUBSTITUTION
    RODRIGUEZ, F
    OLIVER, JL
    MARIN, A
    MEDINA, JR
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 1990, 142 (04) : 485 - 501
  • [57] Detecting and overcoming systematic errors in genome-scale phylogenies
    Rodriguez-Ezpeleta, Naiara
    Brinkmann, Henner
    Roure, Beatrice
    Lartillot, Nicolas
    Lang, B. Franz
    Philippe, Herve
    [J]. SYSTEMATIC BIOLOGY, 2007, 56 (03) : 389 - 399
  • [58] Phylogenomic analysis reveals bees and wasps (Hymenoptera) at the base of the radiation of Holometabolous insects
    Savard, Joel
    Tautz, Diethard
    Richards, Stephen
    Weinstock, George M.
    Gibbs, Richard A.
    Werren, John H.
    Tettelin, Herve
    Lercher, Martin J.
    [J]. GENOME RESEARCH, 2006, 16 (11) : 1334 - 1338
  • [59] Nucleotide bias causes a genomewide bias in the amino acid composition of proteins
    Singer, GAC
    Hickey, DA
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (11) : 1581 - 1588
  • [60] Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content
    Singer, GAC
    Hickey, DA
    [J]. GENE, 2003, 317 (1-2) : 39 - 47