Prediction of interface residues in protein-protein complexes by a consensus neural network method: Test against NMR data

被引:228
作者
Chen, HL
Zhou, HX [1 ]
机构
[1] Florida State Univ, Dept Phys, Tallahassee, FL 32306 USA
[2] Florida State Univ, Inst Mol Biophys, Tallahassee, FL 32306 USA
[3] Florida State Univ, Sch Computat Sci, Tallahassee, FL 32306 USA
[4] Drexel Univ, Dept Phys, Philadelphia, PA 19104 USA
关键词
protein-protein interaction; protein complexes; neural network; interface prediction; protein docking;
D O I
10.1002/prot.20514
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The number of structures of protein-protein complexes deposited to the Protein Data Bank is growing rapidly. These structures embed important information for predicting structures of new protein complexes. This motivated us to develop the PPISP method for predicting interface residues in protein-protein complexes. In PPISP, sequence profiles and solvent accessibility of spatially neighboring surface residues were used as input to a neural network. The network was trained on native interface residues collected from the Protein Data Bank. The prediction accuracy at the time was 70% with 47% coverage of native interface residues. Now we have extensively improved PPISP. The training set now consisted of 1156 nonhomologous protein chains. Test on a set of 100 nonhomologous protein chains showed that the prediction accuracy is now increased to 80% with 51% coverage. To solve the problem of over-prediction and under-prediction associated with individual neural network models, we developed a consensus method that combines predictions from multiple models with different levels of accuracy and coverage. Applied on a benchmark set of 68 proteins for protein protein docking, the consensus approach outperformed the best individual models by 3-8 percentage points in accuracy. To demonstrate the predictive power of cons-PPISP, eight complex-forming proteins with interfaces characterized by NMR were tested. These proteins are nonhomologous to the training set and have a total of 144 interface residues identified by chemical shift perturbation. cons-PPISP predicted 174 interface residues with 69% accuracy and 47% coverage and promises to complement experimental techniques in characterizing protein-protein interfaces.
引用
收藏
页码:21 / 35
页数:15
相关论文
共 68 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] ConSurf: An algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information
    Armon, A
    Graur, D
    Ben-Tal, N
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (01) : 447 - 463
  • [3] Structure of a flavivirus envelope glycoprotein in its low-pH-induced membrane fusion conformation
    Bressanelli, S
    Stiasny, K
    Allison, SL
    Stura, EA
    Duquerroy, S
    Lescar, J
    Heinz, FX
    Rey, FA
    [J]. EMBO JOURNAL, 2004, 23 (04) : 728 - 738
  • [4] Cellulosome assembly revealed by the crystal structure of the cohesin-dockerin complex
    Carvalho, AL
    Dias, FMV
    Prates, JAM
    Nagy, T
    Gilbert, HJ
    Davies, GJ
    Ferreira, LMA
    Romao, MJ
    Fontes, CMGA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (24) : 13809 - 13814
  • [5] A protein-protein docking benchmark
    Chen, R
    Mintseris, J
    Janin, J
    Weng, ZP
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 52 (01): : 88 - 91
  • [7] HADDOCK: A protein-protein docking approach based on biochemical or biophysical information
    Dominguez, C
    Boelens, R
    Bonvin, AMJJ
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2003, 125 (07) : 1731 - 1737
  • [8] Structural basis of the interaction between the AAA ATPase p97/VCP and its adaptor protein p47
    Dreveny, I
    Kondo, H
    Uchiyama, K
    Shaw, A
    Zhang, XD
    Freemont, PS
    [J]. EMBO JOURNAL, 2004, 23 (05) : 1030 - 1039
  • [9] Insight into the PrPC → PrPSc conversion from the structures of antibody-bound ovine prion scrapie-susceptibility variants
    Eghiaian, F
    Grosclaude, J
    Lesceu, S
    Debey, P
    Doublet, B
    Tréguer, E
    Rezaei, H
    Knossow, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (28) : 10254 - 10259
  • [10] Prediction of protein-protein interaction sites in heterocomplexes with neural networks
    Fariselli, P
    Pazos, F
    Valencia, A
    Casadio, R
    [J]. EUROPEAN JOURNAL OF BIOCHEMISTRY, 2002, 269 (05): : 1356 - 1361