Different versions of the Dayhoff rate matrix

被引:100
作者
Kosiol, C [1 ]
Goldman, N [1 ]
机构
[1] EMBL, European Bioinformat Inst, Hinxton, England
关键词
amino acid replacement; Dayhoff matrix; Markov models; phylogenetic inference; protein evolution;
D O I
10.1093/molbev/msi005
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Many phylogenetic inference methods are based on Markov models of sequence evolution. These are usually expressed in terms of a matrix (Q) of instantaneous rates of change but some models of amino acid replacement. most notably the PAM model of Dayhoff and colleagues, were originally published only in terms of time-dependent probability matrices (P(t)). Previously published methods for deriving Q have used eigen-decomposition of an approximation to P(t). We show that the commonly used value of t is too large to ensure convergence of the estimates of elements of Q. We describe two simpler alternative methods for deriving Q from information such as that published by Dayhoff and colleagues. Neither of these methods requires approximation or eigen-decomposition. We identify the methods used to derive various different versions of the Dayhoff model in current software. perform a comparison of existing and new implementations, and, to facilitate agreement among scientists using supposedly identical models, recommend that one of the new methods be used as a standard.
引用
收藏
页码:193 / 199
页数:7
相关论文
共 35 条
  • [1] Plastid genome phylogeny and a model of amino acid substitution for proteins encoded by chloroplast DNA
    Adachi, J
    Waddell, PJ
    Martin, W
    Hasegawa, M
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 2000, 50 (04) : 348 - 358
  • [2] Adachi J, 1996, J MOL EVOL, V42, P459
  • [3] ADACHI J, 1992, MOLPHY VERSION 2 3 P
  • [4] [Anonymous], 1972, ATLAS PROTEIN SEQUEN
  • [5] [Anonymous], 1978, Atlas of protein sequence and structure
  • [6] Balding D., 2003, HDB STAT GENETICS, P209
  • [7] CAO Y, 1994, J MOL EVOL, V39, P519
  • [8] Rate matrices for analyzing large families of protein sequences
    Devauchelle, C
    Grossmann, A
    Hénaut, A
    Holschneider, M
    Monnerot, M
    Risler, JL
    Torrésani, B
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (04) : 381 - 399
  • [9] rtREV: An amino acid substitution matrix for inference of retrovirus and reverse transcriptase phylogeny
    Dimmic, MW
    Rest, JS
    Mindell, DP
    Goldstein, RA
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 2002, 55 (01) : 65 - 73
  • [10] Felsenstein J, 1996, METHOD ENZYMOL, V266, P418