Exploiting sequence and structure homologs to identify protein-protein binding sites

被引:78
作者
Chung, JL
Wang, W
Bourne, PE
机构
[1] Univ Calif San Diego, San Diego Sumpercomp Ctr, Dept Pharmacol, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
关键词
structurally conserved surface residues; protein interfaces; support vector machines;
D O I
10.1002/prot.20741
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A rapid increase in the number of experimentally derived three-dimensional structures provides an opportunity to better understand and subsequently predict protein-protein interactions. In this study, structurally conserved residues were derived from multiple structure alignments of the individual components of known complexes and the assigned conservation score was weighted based on the crystallographic B factor to account for the structural flexibility that will result in a poor alignment. Sequence profile and accessible surface area information was then combined with the conservation score to predict protein-protein binding sites using a Support Vector Machine (SVM). The incorporation of the conservation score significantly improved the performance of the SVM. About 52% of the binding sites were precisely predicted (greater than 70% of the residues in the site were identified); 77% of the binding sites were correctly predicted (greater than 50% of the residues in the site were identified), and 21% of the binding sites were partially covered by the predicted residues (some residues were identified). The results support the hypothesis that in many cases protein interfaces require some residues to provide rigidity to minimize the entropic cost upon complex formation.
引用
收藏
页码:630 / 640
页数:11
相关论文
共 67 条
[61]   Scoring residue conservation [J].
Valdar, WSJ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 48 (02) :227-241
[62]  
Vapnik V.N., 1995, NATURE STAT LEARNING, pXV, DOI DOI 10.1007/978-1-4757-2440-0
[63]   ACCURACY OF PROTEIN FLEXIBILITY PREDICTIONS [J].
VIHINEN, M ;
TORKKILA, E ;
RIIKONEN, P .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1994, 19 (02) :141-149
[64]  
WELLS JA, 1991, METHOD ENZYMOL, V202, P390
[65]   Prediction of protein-protein interactions:: the CAPRI experiment, its evaluation and implications [J].
Wodak, SJ ;
Méndez, R .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2004, 14 (02) :242-249
[66]   A two-stage classifier for identification of protein-protein interface residues [J].
Yan, Changhui ;
Dobbs, Drena ;
Honavar, Vasant .
BIOINFORMATICS, 2004, 20 :371-378
[67]   Prediction of protein interaction sites from sequence profile and residue neighbor list [J].
Zhou, HX ;
Shan, YB .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 44 (03) :336-343