Prediction of interface residues in protein-protein complexes by a consensus neural network method: Test against NMR data

被引:229
作者
Chen, HL
Zhou, HX [1 ]
机构
[1] Florida State Univ, Dept Phys, Tallahassee, FL 32306 USA
[2] Florida State Univ, Inst Mol Biophys, Tallahassee, FL 32306 USA
[3] Florida State Univ, Sch Computat Sci, Tallahassee, FL 32306 USA
[4] Drexel Univ, Dept Phys, Philadelphia, PA 19104 USA
关键词
protein-protein interaction; protein complexes; neural network; interface prediction; protein docking;
D O I
10.1002/prot.20514
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The number of structures of protein-protein complexes deposited to the Protein Data Bank is growing rapidly. These structures embed important information for predicting structures of new protein complexes. This motivated us to develop the PPISP method for predicting interface residues in protein-protein complexes. In PPISP, sequence profiles and solvent accessibility of spatially neighboring surface residues were used as input to a neural network. The network was trained on native interface residues collected from the Protein Data Bank. The prediction accuracy at the time was 70% with 47% coverage of native interface residues. Now we have extensively improved PPISP. The training set now consisted of 1156 nonhomologous protein chains. Test on a set of 100 nonhomologous protein chains showed that the prediction accuracy is now increased to 80% with 51% coverage. To solve the problem of over-prediction and under-prediction associated with individual neural network models, we developed a consensus method that combines predictions from multiple models with different levels of accuracy and coverage. Applied on a benchmark set of 68 proteins for protein protein docking, the consensus approach outperformed the best individual models by 3-8 percentage points in accuracy. To demonstrate the predictive power of cons-PPISP, eight complex-forming proteins with interfaces characterized by NMR were tested. These proteins are nonhomologous to the training set and have a total of 144 interface residues identified by chemical shift perturbation. cons-PPISP predicted 174 interface residues with 69% accuracy and 47% coverage and promises to complement experimental techniques in characterizing protein-protein interfaces.
引用
收藏
页码:21 / 35
页数:15
相关论文
共 68 条
[51]   Prediction and confirmation of a site critical for effector regulation of RGS domain activity [J].
Sowa, ME ;
He, W ;
Slep, KC ;
Kercher, MA ;
Lichtarge, O ;
Wensel, TG .
NATURE STRUCTURAL BIOLOGY, 2001, 8 (03) :234-237
[52]  
Takahashi H, 2000, NAT STRUCT BIOL, V7, P220
[53]   Structure of the core domain of human cardiac troponin in the Ca2+-saturated form [J].
Takeda, S ;
Yamashita, A ;
Maeda, K ;
Maéda, Y .
NATURE, 2003, 424 (6944) :35-41
[54]   Role of an N-terminal site of Ubc9 in SUMO-1,-2, and-3 binding and conjugation [J].
Tatham, MH ;
Kim, S ;
Yu, B ;
Jaffray, E ;
Song, J ;
Zheng, J ;
Rodriguez, MS ;
Hay, RT ;
Chen, Y .
BIOCHEMISTRY, 2003, 42 (33) :9959-9969
[55]   Structural basis of protein phosphatase 1 regulation [J].
Terrak, M ;
Kerff, F ;
Langsetmo, K ;
Tao, T ;
Dominguez, R .
NATURE, 2004, 429 (6993) :780-784
[56]   A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae [J].
Uetz, P ;
Giot, L ;
Cagney, G ;
Mansfield, TA ;
Judson, RS ;
Knight, JR ;
Lockshon, D ;
Narayan, V ;
Srinivasan, M ;
Pochart, P ;
Qureshi-Emili, A ;
Li, Y ;
Godwin, B ;
Conover, D ;
Kalbfleisch, T ;
Vijayadamodar, G ;
Yang, MJ ;
Johnston, M ;
Fields, S ;
Rothberg, JM .
NATURE, 2000, 403 (6770) :623-627
[57]  
Valdar WSJ, 2001, PROTEINS, V42, P108
[58]   Data-driven docking: HADDOCK's adventures in CAPRI [J].
van Dijk, ADJ ;
de Vries, SJ ;
Dominguez, C ;
Chen, H ;
Zhou, HX ;
Bonvin, AMJJ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 60 (02) :232-238
[59]   Crystal structure of troponin C in complex with troponin I fragment at 2.3-Å resolution [J].
Vassylyev, DG ;
Takeda, S ;
Wakatsuki, S ;
Maeda, K ;
Maéda, Y .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (09) :4847-4852
[60]   COMPUTER-ANALYSIS OF PROTEIN-PROTEIN INTERACTION [J].
WODAK, SJ ;
JANIN, J .
JOURNAL OF MOLECULAR BIOLOGY, 1978, 124 (02) :323-342