Protein-protein interaction based on pairwise similarity

被引:48
作者
Zaki, Nazar [1 ]
Lazarova-Molnar, Sanja [1 ]
El-Hajj, Wassim [2 ]
Campbell, Piers [3 ]
机构
[1] UAE Univ, Coll Informat Technol, Dept Comp Sci, Bioinformat Lab, Al Ain 17551, U Arab Emirates
[2] UAE Univ, Coll Informat Technol, Dept Informat Secur, Al Ain 17551, U Arab Emirates
[3] UAE Univ, Coll Informat Technol, Dept Informat Syst, Al Ain 17551, U Arab Emirates
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
PREDICTION;
D O I
10.1186/1471-2105-10-150
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Protein-protein interaction (PPI) is essential to most biological processes. Abnormal interactions may have implications in a number of neurological syndromes. Given that the association and dissociation of protein molecules is crucial, computational tools capable of effectively identifying PPI are desirable. In this paper, we propose a simple yet effective method to detect PPI based on pairwise similarity and using only the primary structure of the protein. The PPI based on Pairwise Similarity (PPI-PS) method consists of a representation of each protein sequence by a vector of pairwise similarities against large subsequences of amino acids created by a shifting window which passes over concatenated protein training sequences. Each coordinate of this vector is typically the E-value of the Smith-Waterman score. These vectors are then used to compute the kernel matrix which will be exploited in conjunction with support vector machines. Results: To assess the ability of the proposed method to recognize the difference between "interacted" and "non-interacted" proteins pairs, we applied it on different datasets from the available yeast saccharomyces cerevisiae protein interaction. The proposed method achieved reasonable improvement over the existing state-of-the-art methods for PPI prediction. Conclusion: Pairwise similarity score provides a relevant measure of similarity between protein sequences. This similarity incorporates biological knowledge about proteins and it is extremely powerful when combined with support vector machine to predict PPI.
引用
收藏
页数:12
相关论文
共 33 条
[1]  
[Anonymous], INT J BIOMEDICAL SCI
[2]   Prediction of protein-protein interactions using random decision forest framework [J].
Chen, XW ;
Liu, M .
BIOINFORMATICS, 2005, 21 (24) :4394-4400
[3]  
Cristianini N., 2000, INTRO SUPPORT VECTOR
[4]   Inferring domain-domain interactions from protein-protein interactions [J].
Deng, MH ;
Mehta, S ;
Sun, FZ ;
Chen, T .
GENOME RESEARCH, 2002, 12 (10) :1540-1548
[5]  
EDWARD M, 1999, SCIENCE, V285, P751
[6]  
Fields S, 1997, ADV MOLEC BIOL, P3
[7]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[8]   POINT: a database for the prediction of protein-protein interactions based on the orthologous interactome [J].
Huang, TW ;
Tien, AC ;
Lee, YCG ;
Huang, WS ;
Lee, YCG ;
Peng, CL ;
Tseng, HH ;
Kao, CY ;
Huang, CYF .
BIOINFORMATICS, 2004, 20 (17) :3273-3276
[9]  
JUWEN S, 2007, NATL ACAD SCI, V11, P4337
[10]   Combining pairwise-sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships [J].
Liao, L ;
Noble, WS .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (06) :857-868