Site interdependence attributed to tertiary structure in amino acid sequence evolution

被引:71
作者
Rodrigue, N
Lartillot, N
Bryant, D
Philippe, H
机构
[1] Univ Montreal, Canadian Inst Adv Res, Dept Biochim, Montreal, PQ H3C 3J7, Canada
[2] Lab Informat Robot & Microelect Montpellier, Montpellier, France
[3] McGill Univ, McGill Ctr Bioinformat, Montreal, PQ, Canada
关键词
protein evolution; phylogenetics; Bayesian Markov chain Monte Carlo; statistical potentials;
D O I
10.1016/j.gene.2004.12.011
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Standard likelihood-based frameworks in phylogenetics consider the process of evolution of a sequence site by site. Assuming that sites evolve independently greatly simplifies the required calculations. However, this simplification is known to be incorrect in many cases. Here, a computational method that allows for general dependence between sites of a sequence is investigated. Using this method, measures acting as sequence fitness proxies can be considered over a phylogenetic tree. In this work, a set of statistically derived amino acid pairwise potentials, developed in the context of protein threading, is used to account for what we call the structural fitness of a sequence. We describe a model combining statistical potentials with an empirical amino acid substitution matrix. We propose such a combination as a useful way of capturing the complexity of protein evolution. Finally, we outline features of the model using three datasets and show the approach's sensitivity to different tree topologies. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:207 / 217
页数:11
相关论文
共 36 条
[21]  
2-B
[22]  
KOSHI JM, 1995, PROTEIN ENG, V8, P641
[23]   Markov chain Monte Carlo algorithms for the Bayesian analysis of phylogenetic trees [J].
Larget, B ;
Simon, DL .
MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (06) :750-759
[24]   A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process [J].
Lartillot, N ;
Philippe, H .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (06) :1095-1109
[25]   EQUATION OF STATE CALCULATIONS BY FAST COMPUTING MACHINES [J].
METROPOLIS, N ;
ROSENBLUTH, AW ;
ROSENBLUTH, MN ;
TELLER, AH ;
TELLER, E .
JOURNAL OF CHEMICAL PHYSICS, 1953, 21 (06) :1087-1092
[26]   ESTIMATION OF EFFECTIVE INTERRESIDUE CONTACT ENERGIES FROM PROTEIN CRYSTAL-STRUCTURES - QUASI-CHEMICAL APPROXIMATION [J].
MIYAZAWA, S ;
JERNIGAN, RL .
MACROMOLECULES, 1985, 18 (03) :534-552
[27]   Mapping mutations on phylogenies [J].
Nielsen, R .
SYSTEMATIC BIOLOGY, 2002, 51 (05) :729-739
[28]   A phylogenetic mixture model for detecting pattern-heterogeneity in gene sequence or character-state data [J].
Pagel, M ;
Meade, A .
SYSTEMATIC BIOLOGY, 2004, 53 (04) :571-581
[29]   Structural constraints and emergence of sequence patterns in protein evolution [J].
Parisi, G ;
Echave, J .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (05) :750-756
[30]   A dependent-rates model and an MCMC-based methodology for the maximum-likelihood analysis of sequences with overlapping reading frames [J].
Pedersen, AMK ;
Jensen, JL .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (05) :763-776