High-throughput identification of interacting protein-protein binding sites

被引:10
作者
Chung, Jo-Lan
Wang, Wei
Bourne, Philip E. [1 ]
机构
[1] Univ Calif San Diego, Dept Pharmacol, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, San Diego Supercomp Ctr, La Jolla, CA 92093 USA
来源
BMC BIOINFORMATICS | 2007年 / 8卷
关键词
SUPPORT VECTOR MACHINES; STATISTICAL-ANALYSIS; CRYSTAL-STRUCTURE; HOT-SPOTS; PREDICTION; INTERFACES; SEQUENCE; INFORMATION; COMPLEXES; DOCKING;
D O I
10.1186/1471-2105-8-223
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: With the advent of increasing sequence and structural data, a number of methods have been proposed to locate putative protein binding sites from protein surfaces. Therefore, methods that are able to identify whether these binding sites interact are needed. Results: We have developed a new method using a machine learning approach to detect if protein binding sites, once identified, interact with each other. The method exploits information relating to sequence and structural complementary across protein interfaces and has been tested on a non-redundant data set consisting of 584 homo-dimers and 198 hetero-dimers extracted from the PDB. Results indicate 87.4% of the interacting binding sites and 68.6% non-interacting binding sites were correctly identified. Furthermore, we built a pipeline that links this method to a modified version of our previously developed method that predicts the location of binding sites. Conclusion: We have demonstrated that this high-throughput pipeline is capable of identifying binding sites for proteins, their interacting binding sites and, ultimately, their binding partners on a large scale.
引用
收藏
页数:12
相关论文
共 65 条
[1]   Structure-based assembly of protein complexes in yeast [J].
Aloy, P ;
Böttcher, B ;
Ceulemans, H ;
Leutwein, C ;
Mellwig, C ;
Fischer, S ;
Gavin, AC ;
Bork, P ;
Superti-Furga, G ;
Serrano, L ;
Russell, RB .
SCIENCE, 2004, 303 (5666) :2026-2029
[2]   The relationship between sequence and interaction divergence in proteins [J].
Aloy, P ;
Ceulemans, H ;
Stark, A ;
Russell, RB .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (05) :989-998
[3]   Interrogating protein interaction networks through structural biology [J].
Aloy, P ;
Russell, RB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (09) :5896-5901
[4]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[5]   Statistical analysis of predominantly transient protein-protein interfaces [J].
Ansari, S ;
Helms, V .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 (02) :344-355
[6]   Prediction of protein-protein interactions by combining structure and sequence conservation in protein interfaces [J].
Aytuna, AS ;
Gursoy, A ;
Keskin, O .
BIOINFORMATICS, 2005, 21 (12) :2850-2855
[7]   The universal protein resource (UniProt) [J].
Bairoch, A ;
Apweiler, R ;
Wu, CH ;
Barker, WC ;
Boeckmann, B ;
Ferro, S ;
Gasteiger, E ;
Huang, HZ ;
Lopez, R ;
Magrane, M ;
Martin, MJ ;
Natale, DA ;
O'Donovan, C ;
Redaschi, N ;
Yeh, LSL .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D154-D159
[8]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[9]   Anatomy of hot spots in protein interfaces [J].
Bogan, AA ;
Thorn, KS .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 280 (01) :1-9
[10]   Statistical analysis and prediction of protein-protein interfaces [J].
Bordner, AJ ;
Abagyan, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 60 (03) :353-366