Predicting protein-protein binding sites in membrane proteins

被引:24
作者
Bordner, Andrew J. [1 ]
机构
[1] Mayo Clin, Scottsdale, AZ 85259 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
SEQUENCE PROFILE; DATA-BANK; IDENTIFICATION; INTERFACES; COMPLEXES; PROGRAM;
D O I
10.1186/1471-2105-10-312
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Many integral membrane proteins, like their non-membrane counterparts, form either transient or permanent multi-subunit complexes in order to carry out their biochemical function. Computational methods that provide structural details of these interactions are needed since, despite their importance, relatively few structures of membrane protein complexes are available. Results: We present a method for predicting which residues are in protein-protein binding sites within the transmembrane regions of membrane proteins. The method uses a Random Forest classifier trained on residue type distributions and evolutionary conservation for individual surface residues, followed by spatial averaging of the residue scores. The prediction accuracy achieved for membrane proteins is comparable to that for non-membrane proteins. Also, like previous results for non-membrane proteins, the accuracy is significantly higher for residues distant from the binding site boundary. Furthermore, a predictor trained on non-membrane proteins was found to yield poor accuracy on membrane proteins, as expected from the different distribution of surface residue types between the two classes of proteins. Thus, although the same procedure can be used to predict binding sites in membrane and non-membrane proteins, separate predictors trained on each class of proteins are required. Finally, the contribution of each residue property to the overall prediction accuracy is analyzed and prediction examples are discussed. Conclusion: Given a membrane protein structure and a multiple alignment of related sequences, the presented method gives a prioritized list of which surface residues participate in intramembrane protein-protein interactions. The method has potential applications in guiding the experimental verification of membrane protein interactions, structure-based drug discovery, and also in constraining the search space for computational methods, such as protein docking or threading, that predict membrane protein complex structures.
引用
收藏
页数:10
相关论文
共 37 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Properties and identification of human protein drug targets [J].
Bakheet, Tala M. ;
Doig, Andrew J. .
BIOINFORMATICS, 2009, 25 (04) :451-457
[3]   Statistical analysis and prediction of protein-protein interfaces [J].
Bordner, AJ ;
Abagyan, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 60 (03) :353-366
[4]   REVCOM: a robust Bayesian method for evolutionary rate estimation [J].
Bordner, AJ ;
Abagyan, R .
BIOINFORMATICS, 2005, 21 (10) :2315-2321
[5]   Predicting small ligand binding sites in proteins using backbone structure [J].
Bordner, Andrew J. .
BIOINFORMATICS, 2008, 24 (24) :2865-2871
[6]   Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces [J].
Bordner, Andrew J. ;
Gorin, Andrey A. .
BMC BIOINFORMATICS, 2008, 9 (1)
[7]   Improved prediction of protein-protein binding sites using a support vector machines approach [J].
Bradford, JR ;
Westhead, DR .
BIOINFORMATICS, 2005, 21 (08) :1487-1494
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Predicting protein interaction sites: binding hot-spots in protein-protein and protein-ligand interfaces [J].
Burgoyne, Nicholas J. ;
Jackson, Richard M. .
BIOINFORMATICS, 2006, 22 (11) :1335-1342
[10]   Prediction of interface residues in protein-protein complexes by a consensus neural network method: Test against NMR data [J].
Chen, HL ;
Zhou, HX .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 (01) :21-35