Quantifying the impact of protein tertiary structure on molecular evolution

被引:39
作者
Choi, Sang Chul [1 ]
Hobolth, Asger
Robinson, Douglas M.
Kishino, Hirohisa
Thorne, Jeffrey L.
机构
[1] N Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
[2] Bristol Myers Squibb Co, Stat Genet & Biomarkers Grp, Pennington, NJ USA
[3] Univ Tokyo, Grad Sch Agr & Life Sci, Lab Biometr, Tokyo, Japan
[4] Wissenschaftskolleg Berlin, Berlin, Germany
关键词
molecular evolution; protein structure impact; Gene Ontology; MCMC; Bayes factor;
D O I
10.1093/molbev/msm097
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To investigate the evolutionary impact of protein structure, the experimentally determined tertiary structure and the protein-coding DNA sequence were collected for each of 1, 195 genes. These genes were studied via a model of sequence change that explicitly incorporates effects on evolutionary rates due to protein tertiary structure. In the model, these effects act via the solvent accessibility environments and pairwise amino acid interactions that are induced by tertiary structure. To compare the hypotheses that structure does and does not have a strong influence on evolution, Bayes factors were estimated for each of the 1,195 sequences. Most of the Bayes factors strongly support the hypothesis that protein structure affects protein evolution. Furthermore, both solvent accessibility and pairwise interactions among amino acids are inferred to have important roles in protein evolution. Our results also indicate that the strength of the relationship between tertiary structure and evolution has a weak but real correlation to the annotation information in the Gene Ontology database. Although their influences on rates of evolution vary among protein families, we find that the mean impacts of solvent accessibility and pairwise interactions are about the same.
引用
收藏
页码:1769 / 1782
页数:14
相关论文
共 53 条
[41]   Site interdependence attributed to tertiary structure in amino acid sequence evolution [J].
Rodrigue, N ;
Lartillot, N ;
Bryant, D ;
Philippe, H .
GENE, 2005, 347 (02) :207-217
[42]   Assessing site-interdependent phylogenetic models of sequence evolution [J].
Rodrigue, Nicolas ;
Philippe, Herve ;
Lartillot, Nicolas .
MOLECULAR BIOLOGY AND EVOLUTION, 2006, 23 (09) :1762-1775
[43]   Recognition of analogous and homologous protein folds: Analysis of sequence and structure conservation [J].
Russell, RB ;
Saqi, MAS ;
Sayle, RA ;
Bates, PA ;
Sternberg, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 269 (03) :423-439
[44]   Phylogenetic estimation of context-dependent substitution rates by maximum likelihood [J].
Siepel, A ;
Haussler, D .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (03) :468-488
[45]  
Stein C., 1956, Proceedings of the Third Berkeley symposium on mathematical statistics and probability, V1, P197, DOI DOI 10.1525/9780520313880-018
[46]   Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages [J].
Yang, ZH ;
Nielsen, R .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (06) :908-917
[47]  
Yang ZH, 2000, GENETICS, V155, P431
[48]   MAXIMUM-LIKELIHOOD PHYLOGENETIC ESTIMATION FROM DNA-SEQUENCES WITH VARIABLE RATES OVER SITES - APPROXIMATE METHODS [J].
YANG, ZH .
JOURNAL OF MOLECULAR EVOLUTION, 1994, 39 (03) :306-314
[49]   Synonymous and nonsynonymous rate variation in nuclear genes of mammals [J].
Yang, ZH ;
Nielsen, R .
JOURNAL OF MOLECULAR EVOLUTION, 1998, 46 (04) :409-418
[50]   Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classes [J].
Yang, ZH ;
Swanson, WJ .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (01) :49-57