Testing for spatial clustering of amino acid replacements within protein tertiary structure

被引:11
作者
Yu, Jiaye [1 ]
Thorne, Jeffrey L. [1 ]
机构
[1] N Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
关键词
protein tertiary structure; protein evolution; spatial clustering;
D O I
10.1007/s00239-005-0107-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Widely used models of protein evolution ignore protein structure. Therefore, these models do not predict spatial clustering of amino acid replacements with respect to tertiary structure. One formal and biologically implausible possibility is that there is no tendency for amino acid replacements to be spatially clustered during evolution. An alternative to this is that amino acid replacements are spatially clustered and this spatial clustering can be fully explained by a tendency for similar rates of amino acid replacement at sites that are nearby in protein tertiary structure. A third possibility is that the amount of clustering exceeds that which can be explained solely on the basis of independently evolving protein sites with spatially clustered replacement rates. We introduce two simple and not very parametric hypothesis tests that help distinguish these three possibilities. We then apply these tests to 273 homologous protein families. The null hypothesis of no spatial clustering is rejected for 102 of 273 families. The explanation of spatially clustered rates but independent change among sites is rejected for 43 families. These findings need to be reconciled with the common practice of basing evolutionary inferences on models that assume independent change among sites.
引用
收藏
页码:682 / 692
页数:11
相关论文
共 39 条
[21]  
MOOD AM, 1974, INTRO THOERY STAT
[22]  
MUSE SV, 1994, MOL BIOL EVOL, V11, P715
[23]   ANALYSIS OF MUTATIONS IN THE TRANSMEMBRANE REGION OF THE ASPARTATE CHEMORECEPTOR IN ESCHERICHIA-COLI [J].
OOSAWA, K ;
SIMON, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (18) :6930-6934
[24]   A dependent-rates model and an MCMC-based methodology for the maximum-likelihood analysis of sequences with overlapping reading frames [J].
Pedersen, AMK ;
Jensen, JL .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (05) :763-776
[25]   Coevolving protein residues: Maximum likelihood identification and relationship to structure [J].
Pollock, DD ;
Taylor, WR ;
Goldman, N .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 287 (01) :187-198
[26]   Evaluation of a novel method for the identification of coevolving protein residues [J].
Pritchard, L ;
Bladon, P ;
Mitchell, JMO ;
Dufton, MJ .
PROTEIN ENGINEERING, 2001, 14 (08) :549-555
[27]   A fast algorithm for joint reconstruction of ancestral amino acid sequences [J].
Pupko, T ;
Pe'er, I ;
Shamir, R ;
Graur, D .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (06) :890-896
[28]   A branch-and-bound algorithm for the inference of ancestral amino-acid sequences when the replacement rate varies among sites: Application to the evolution of five gene families [J].
Pupko, T ;
Pe'er, I ;
Hasegawa, M ;
Graur, D ;
Friedman, N .
BIOINFORMATICS, 2002, 18 (08) :1116-1123
[29]   Protein evolution with dependence among codons due to tertiary structure [J].
Robinson, DM ;
Jones, DT ;
Kishino, H ;
Goldman, N ;
Thorne, JL .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (10) :1692-1704
[30]   Site interdependence attributed to tertiary structure in amino acid sequence evolution [J].
Rodrigue, N ;
Lartillot, N ;
Bryant, D ;
Philippe, H .
GENE, 2005, 347 (02) :207-217