Choosing appropriate substitution models for the phylogenetic analysis of protein-coding sequences

被引:577
作者
Shapiro, B [1 ]
Rambaut, A [1 ]
Drummond, AJ [1 ]
机构
[1] Univ Oxford, Dept Zool, Oxford OX1 3PS, England
基金
英国惠康基金;
关键词
phylogenetic inference; protein-coding sequences; substitution models;
D O I
10.1093/molbev/msj021
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although phylogenetic inference of protein-coding sequences continues to dominate the literature, few analyses incorporate evolutionary models that consider the genetic code. This problem is exacerbated by the exclusion of codon-based models from commonly employed model selection techniques, presumably due to the computational cost associated with codon models. We investigated an efficient alternative to standard nucleotide substitution models, in which codon position (CP) is incorporated into the model. We determined the most appropriate model for alignments of 177 RNA virus genes and 106 yeast genes, using 11 substitution models including one codon model and four CP models. The majority of analyzed gene alignments are best described by CP substitution models, rather than by standard nucleotide models, and without the computational cost of full codon models. These results have significant implications for phylogenetic inference of coding sequences as they make it clear that substitution models incorporating CPs not only are a computationally realistic alternative to standard models but may also frequently be statistically superior.
引用
收藏
页码:7 / 9
页数:3
相关论文
共 22 条
[1]   Topological bias and inconsistency of maximum likelihood using wrong models [J].
Bruno, WJ ;
Halpern, AL .
MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (04) :564-566
[2]   The effects of nucleotide substitution model assumptions on estimates of nonparametric bootstrap support [J].
Buckley, TR ;
Cunningham, CW .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (04) :394-405
[3]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[4]   The troubled growth of statistical phylogenetics [J].
Felsenstein, J .
SYSTEMATIC BIOLOGY, 2001, 50 (04) :465-467
[5]   STATISTICAL TESTS OF MODELS OF DNA SUBSTITUTION [J].
GOLDMAN, N .
JOURNAL OF MOLECULAR EVOLUTION, 1993, 36 (02) :182-198
[6]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[7]   DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA [J].
HASEGAWA, M ;
KISHINO, H ;
YANO, TA .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) :160-174
[8]   MRBAYES: Bayesian inference of phylogenetic trees [J].
Huelsenbeck, JP ;
Ronquist, F .
BIOINFORMATICS, 2001, 17 (08) :754-755
[9]   A NEW METHOD FOR CALCULATING EVOLUTIONARY SUBSTITUTION RATES [J].
LANAVE, C ;
PREPARATA, G ;
SACCONE, C ;
SERIO, G .
JOURNAL OF MOLECULAR EVOLUTION, 1984, 20 (01) :86-93
[10]  
MUSE SV, 1994, MOL BIOL EVOL, V11, P715