Site interdependence attributed to tertiary structure in amino acid sequence evolution

被引:71
作者
Rodrigue, N
Lartillot, N
Bryant, D
Philippe, H
机构
[1] Univ Montreal, Canadian Inst Adv Res, Dept Biochim, Montreal, PQ H3C 3J7, Canada
[2] Lab Informat Robot & Microelect Montpellier, Montpellier, France
[3] McGill Univ, McGill Ctr Bioinformat, Montreal, PQ, Canada
关键词
protein evolution; phylogenetics; Bayesian Markov chain Monte Carlo; statistical potentials;
D O I
10.1016/j.gene.2004.12.011
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Standard likelihood-based frameworks in phylogenetics consider the process of evolution of a sequence site by site. Assuming that sites evolve independently greatly simplifies the required calculations. However, this simplification is known to be incorrect in many cases. Here, a computational method that allows for general dependence between sites of a sequence is investigated. Using this method, measures acting as sequence fitness proxies can be considered over a phylogenetic tree. In this work, a set of statistically derived amino acid pairwise potentials, developed in the context of protein threading, is used to account for what we call the structural fitness of a sequence. We describe a model combining statistical potentials with an empirical amino acid substitution matrix. We propose such a combination as a useful way of capturing the complexity of protein evolution. Finally, we outline features of the model using three datasets and show the approach's sensitivity to different tree topologies. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:207 / 217
页数:11
相关论文
共 36 条
[31]   Protein evolution with dependence among codons due to tertiary structure [J].
Robinson, DM ;
Jones, DT ;
Kishino, H ;
Goldman, N ;
Thorne, JL .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (10) :1692-1704
[32]   Phylogenetic estimation of context-dependent substitution rates by maximum likelihood [J].
Siepel, A ;
Haussler, D .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (03) :468-488
[33]  
Singh B., 2001, Animal Nutrition and Feed Technology, V1, P3
[34]   Combining protein evolution and secondary structure [J].
Thorne, JL ;
Goldman, N ;
Jones, DT .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (05) :666-673
[35]  
YANG ZH, 1993, MOL BIOL EVOL, V10, P1396
[36]   MAXIMUM-LIKELIHOOD PHYLOGENETIC ESTIMATION FROM DNA-SEQUENCES WITH VARIABLE RATES OVER SITES - APPROXIMATE METHODS [J].
YANG, ZH .
JOURNAL OF MOLECULAR EVOLUTION, 1994, 39 (03) :306-314