Improvement in protein functional site prediction by distinguishing structural and functional constraints on protein family evolution using computational design

被引:63
作者
Cheng, G
Qian, B
Samudrala, R
Baker, D [1 ]
机构
[1] Univ Washington, Howard Hughes Med Inst, Dept Biochem, Seattle, WA 98195 USA
[2] Univ Washington, Howard Hughes Med Inst, Biomol Struct & Design Program, Seattle, WA 98195 USA
[3] Univ Washington, Dept Microbiol, Seattle, WA 98195 USA
关键词
D O I
10.1093/nar/gki894
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The prediction of functional sites in newly solved protein structures is a challenge for computational structural biology. Most methods for approaching this problem use evolutionary conservation as the primary indicator of the location of functional sites. However, sequence conservation reflects not only evolutionary selection at functional sites to maintain protein function, but also selection throughout the protein to maintain the stability of the folded state. To disentangle sequence conservation due to protein functional constraints from sequence conservation due to protein structural constraints, we use all atom computational protein design methodology to predict sequence profiles expected under solely structural constraints, and to compute the free energy difference between the naturally occurring amino acid and the lowest free energy amino acid at each position. We show that functional sites are more likely than non-functional sites to have computed sequence profiles which differ significantly from the naturally occurring sequence profiles and to have residues with sub-optimal free energies, and that incorporation of these two measures improves sequence based prediction of protein functional sites. The combined sequence and structure based functional site prediction method has been implemented in a publicly available web server.
引用
收藏
页码:5861 / 5867
页数:7
相关论文
共 28 条
[1]   ConSurf: An algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information [J].
Armon, A ;
Graur, D ;
Ben-Tal, N .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (01) :447-463
[2]   Structural bases of stability-function tradeoffs in enzymes [J].
Beadle, BM ;
Shoichet, BK .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 321 (02) :285-296
[3]   GenBank [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D34-D38
[4]   Distinguishing structural and functional restraints in evolution in order to identify interaction sites [J].
Chelliah, V ;
Chen, L ;
Blundell, TL ;
Lovell, SC .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 342 (05) :1487-1504
[5]   De novo protein design: Fully automated sequence selection [J].
Dahiyat, BI ;
Mayo, SL .
SCIENCE, 1997, 278 (5335) :82-87
[6]  
Delano WL, 2002, PYMOL USERS MANUAL
[7]   MUSCLE: a multiple sequence alignment method with reduced time and space complexity [J].
Edgar, RC .
BMC BIOINFORMATICS, 2004, 5 (1) :1-19
[8]   Prediction of functionally important residues based solely on the computed energetics of protein structure [J].
Elcock, AH .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 312 (04) :885-896
[9]   Predicting changes in the stability of proteins and protein complexes: A study of more than 1000 mutations [J].
Guerois, R ;
Nielsen, JE ;
Serrano, L .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 320 (02) :369-387
[10]   Using a neural network and spatial clustering to predict the location of active sites in enzymes [J].
Gutteridge, A ;
Bartlett, GJ ;
Thornton, JM .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 330 (04) :719-734