A Bayesian compound stochastic process for modeling nonstationary and nonhomogeneous sequence evolution

被引:95
作者
Blanquart, Samuel [1 ]
Lartillot, Nicolas [1 ]
机构
[1] CNRS, LIRMM, Projet Methodes & Algorithmes Bioinformat, Montpellier, France
关键词
phylogeny; MCMC; nonstationary; nonhomogeneous; compositional bias; compound stochastic process;
D O I
10.1093/molbev/msl091
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Variations of nucleotidic composition affect phylogenetic inference conducted under stationary models of evolution. In particular, they may cause unrelated taxa sharing similar base composition to be grouped together in the resulting phylogeny. To address this problem, we developed a nonstationary and nonhomogeneous model accounting for compositional biases. Unlike previous nonstationary models, which are branchwise, that is, assume that base composition only changes at the nodes of the tree, in our model, the process of compositional drift is totally uncoupled from the speciation events. In addition, the total number of events of compositional drift distributed across the tree is directly inferred from the data. We implemented the method in a Bayesian framework, relying on Markov Chain Monte Carlo algorithms, and applied it to several nucleotidic data sets. In most cases, the stationarity assumption was rejected in favor of our nonstationary model. In addition, we show that our method is able to resolve a well-known artifact. By Bayes factor evaluation, we compared our model with 2 previously developed nonstationary models. We show that the coupling between speciations and compositional shifts inherent to branchwise models may lead to an overparameterization, resulting in a lesser fit. In some cases, this leads to incorrect conclusions, concerning the nature of the compositional biases. In contrast, our compound model more flexibly adapts its effective number of parameters to the data sets under investigation. Altogether, our results show that accounting for nonstationary sequence evolution may require more elaborate and more flexible models than those currently used.
引用
收藏
页码:2058 / 2071
页数:14
相关论文
共 55 条
  • [1] [Anonymous], 2021, Bayesian Data Analysis
  • [2] [Anonymous], 1993, PROBABILISTIC INFERE
  • [3] BERNARDI G, 1993, MOL BIOL EVOL, V10, P186
  • [4] Archaea sister group of bacteria? Indications from tree reconstruction artifacts in ancient phylogenies
    Brinkmann, H
    Philippe, H
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (06) : 817 - 825
  • [5] MITOCHONDRIAL-DNA SEQUENCES OF PRIMATES - TEMPO AND MODE OF EVOLUTION
    BROWN, WM
    PRAGER, EM
    WANG, A
    WILSON, AC
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1982, 18 (04) : 225 - 239
  • [6] A phylogenomic study of endosymbiotic bacteria
    Canbäck, B
    Tamas, I
    Andersson, SGE
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (06) : 1110 - 1122
  • [7] Full reconstruction of Markov models on evolutionary trees: Identifiability and consistency
    Chang, JT
    [J]. MATHEMATICAL BIOSCIENCES, 1996, 137 (01) : 51 - 73
  • [8] The Ribosomal Database Project (RDP-II): previewing a new autoaligner that allows regular updates and the new prokaryotic taxonomy
    Cole, JR
    Chai, B
    Marsh, TL
    Farris, RJ
    Wang, Q
    Kulam, SA
    Chandra, S
    McGarrell, DM
    Schmidt, TM
    Garrity, GM
    Tiedje, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 442 - 443
  • [9] Delsuc F, 2003, SCIENCE, V301, P1482
  • [10] Molecular phylogeny of living xenarthrans and the impact of character and taxon sampling on the placental tree rooting
    Delsuc, F
    Scally, M
    Madsen, O
    Stanhope, MJ
    de Jong, WW
    Catzeflis, FM
    Springer, MS
    Douzery, EJP
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (10) : 1656 - 1671