Functional specificity lies within the properties and evolutionary changes of amino acids

被引:51
作者
Chakrabarti, Saikat [1 ]
Bryant, Stephen H. [1 ]
Panchenko, Anna R. [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, Natl Inst Hlth, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
functional divergence; subfamily specificity; physico-chemical properties; combined relative entropy; evolutionary rate;
D O I
10.1016/j.jmb.2007.08.036
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The rapid increase in the amount of protein sequence data has created a need for automated identification of sites that determine functional specificity among related subfamilies of proteins. A significant fraction of subfamily specific sites are only marginally conserved, which makes it extremely challenging to detect those amino acid changes that lead to functional diversification. To address this critical problem we developed a method named SPEER (specificity prediction using amino acids' properties, entropy and evolution rate) to distinguish specificity determining sites from others. SPEER encodes the conservation patterns of amino acid types using their physico-chemical properties and the heterogeneity of evolutionary changes between and within the subfamilies. To test the method, we compiled a test set containing 13 protein families with known specificity determining sites. Extensive benchmarking by comparing the performance of SPEER with other specificity site prediction algorithms has shown that it performs better in predicting several categories of subfamily specific sites. Published by Elsevier Ltd.
引用
收藏
页码:801 / 810
页数:10
相关论文
共 58 条
[41]   Prediction of functional specificity determinants from protein sequences using log-likelihood ratios [J].
Pei, JM ;
Cai, W ;
Kinch, LN ;
Grishin, NV .
BIOINFORMATICS, 2006, 22 (02) :164-171
[42]   Sequence comparison by sequence harmony identifies subtype-specific functional sites [J].
Pirovano, Walter ;
Feenstra, K. Anton ;
Heringa, Jaap .
NUCLEIC ACIDS RESEARCH, 2006, 34 (22) :6540-6548
[43]   A branch-and-bound algorithm for the inference of ancestral amino-acid sequences when the replacement rate varies among sites: Application to the evolution of five gene families [J].
Pupko, T ;
Pe'er, I ;
Hasegawa, M ;
Graur, D ;
Friedman, N .
BIOINFORMATICS, 2002, 18 (08) :1116-1123
[44]   Localization of binding sites in protein structures by optimization of a composite scoring function [J].
Rossi, Andrea ;
Marti-Renom, Marc A. ;
Sali, Andrej .
PROTEIN SCIENCE, 2006, 15 (10) :2366-2380
[45]   DATABASE OF HOMOLOGY-DERIVED PROTEIN STRUCTURES AND THE STRUCTURAL MEANING OF SEQUENCE ALIGNMENT [J].
SANDER, C ;
SCHNEIDER, R .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1991, 9 (01) :56-68
[46]   Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements [J].
Schäffer, AA ;
Aravind, L ;
Madden, TL ;
Shavirin, S ;
Spouge, JL ;
Wolf, YI ;
Koonin, EV ;
Altschul, SF .
NUCLEIC ACIDS RESEARCH, 2001, 29 (14) :2994-3005
[47]   Information content of individual genetic sequences [J].
Schneider, TD .
JOURNAL OF THEORETICAL BIOLOGY, 1997, 189 (04) :427-441
[48]  
Sjolander K, 1998, Proc Int Conf Intell Syst Mol Biol, V6, P165
[49]   Protonation state of methyltetrahydrofolate in a binary complex with cobalamin-dependent methionine synthase [J].
Smith, AE ;
Matthews, RG .
BIOCHEMISTRY, 2000, 39 (45) :13880-13890
[50]  
Sokal R.R., 1995, Biometry: The Principles and Practice of Statistics in Biological Research