Large-scale prediction of protein-protein interactions from structures

被引:71
作者
Hue, Martial [2 ,3 ,4 ]
Riffle, Michael [1 ,5 ]
Vert, Jean-Philippe [2 ,3 ,4 ]
Noble, William S. [1 ,6 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Mines ParisTech, Ctr Computat Biol, F-77305 Fontainebleau, France
[3] Inst Curie, F-75248 Paris, France
[4] INSERM, U900, F-75248 Paris, France
[5] Univ Washington, Dept Biochem, Seattle, WA 98195 USA
[6] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
ALIGNMENT; DATABASE; KERNEL;
D O I
10.1186/1471-2105-11-144
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The prediction of protein-protein interactions is an important step toward the elucidation of protein functions and the understanding of the molecular mechanisms inside the cell. While experimental methods for identifying these interactions remain costly and often noisy, the increasing quantity of solved 3D protein structures suggests that in silico methods to predict interactions between two protein structures will play an increasingly important role in screening candidate interacting pairs. Approaches using the knowledge of the structure are presumably more accurate than those based on sequence only. Approaches based on docking protein structures solve a variant of this problem, but these methods remain very computationally intensive and will not scale in the near future to the detection of interactions at the level of an interactome, involving millions of candidate pairs of proteins. Results: Here, we describe a computational method to predict efficiently in silico whether two protein structures interact. This yes/no question is presumably easier to answer than the standard protein docking question, "How do these two protein structures interact?" Our approach is to discriminate between interacting and non-interacting protein pairs using a statistical pattern recognition method known as a support vector machine (SVM). We demonstrate that our structure-based method performs well on this task and scales well to the size of an interactome. Conclusions: The use of structure information for the prediction of protein interaction yields significantly better performance than other sequence-based methods. Among structure-based classifiers, the SVM algorithm, combined with the metric learning pairwise kernel and the MAMMOTH kernel, performs best in our experiments.
引用
收藏
页数:9
相关论文
共 31 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Choosing negative examples for the prediction of protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BMC BIOINFORMATICS, 2006, 7 (Suppl 1)
[3]   Kernel methods for predicting protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BIOINFORMATICS, 2005, 21 :I38-I46
[4]   Predicting protein-protein interactions from primary structure [J].
Bock, JR ;
Gough, DA .
BIOINFORMATICS, 2001, 17 (05) :455-460
[5]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[6]   Prediction of protein-protein interactions using random decision forest framework [J].
Chen, XW ;
Liu, M .
BIOINFORMATICS, 2005, 21 (24) :4394-4400
[7]  
Davis J., 2006, P INT C MACH LEARN
[8]   Protein interactions - Two methods for assessment of the reliability of high throughput observations [J].
Deane, CM ;
Salwinski, L ;
Xenarios, I ;
Eisenberg, D .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (05) :349-356
[9]  
Dohkan Shinsuke, 2006, In Silico Biology, V6, P515
[10]   Learning to predict protein-protein interactions from protein sequences [J].
Gomez, SM ;
Noble, WS ;
Rzhetsky, A .
BIOINFORMATICS, 2003, 19 (15) :1875-1881