A NEW SUBSTITUTION MATRIX FOR PROTEIN-SEQUENCE SEARCHES BASED ON CONTACT FREQUENCIES IN PROTEIN STRUCTURES

被引:64
作者
MIYAZAWA, S [1 ]
JERNIGAN, RL [1 ]
机构
[1] NCI,BETHESDA,MD 20892
来源
PROTEIN ENGINEERING | 1993年 / 6卷 / 03期
关键词
CONTACT ENERGIES; CONTACT FREQUENCIES; HOMOLOGY SEARCH; SEQUENCE COMPARISON; SUBSTITUTION MATRIX;
D O I
10.1093/protein/6.3.267
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The instabilities of the native structures of mutant proteins with an amino acid exchange are estimated by using the contact energy and the number of contacts for each type of amino acid pair, which were estimated from 18 192 residue - residue contacts observed in 42 crystals of globular proteins. They were then used to evaluate a transition probability matrix of codon substitutions and a log relatedness odds matrix, which is used as a scoring matrix to measure the similarity between protein sequences. To consider amino acid substitutions in homologous proteins, base mutation rates and the effects of the genetic code are also taken into account. The average fitness of an amino acid exchange is approximated to be proportional to the structural stability of the mutant protein, which is then approximated by the average energy change of the protein native structure expected for the amino acid exchange with neglect of the energy change of the denatured state. In global and local homology searches, this scoring matrix tends to yield significantly higher alignment scores than either the unitary matrix or the genetic code matrix, and also may yield higher alignment scores for distantly related protein pairs than MDM78. One of advantages of this scoring matrix is that the equilibrium frequencies of codons and also base mutation rates can be adjusted.
引用
收藏
页码:267 / 278
页数:12
相关论文
共 20 条