The Development of a Universal In Silico Predictor of Protein-Protein Interactions

被引:28
作者
Valente, Guilherme T. [1 ]
Acencio, Marcio L. [2 ]
Martins, Cesar [1 ]
Lemke, Ney [2 ]
机构
[1] Univ Estadual Paulista, UNESP, Dept Morphol, Botucatu, SP, Brazil
[2] Univ Estadual Paulista, UNESP, Dept Phys & Biophys, Botucatu, SP, Brazil
来源
PLOS ONE | 2013年 / 8卷 / 05期
基金
巴西圣保罗研究基金会;
关键词
LINK-PREDICTION; INTEGRATION; PRINCIPLES; NETWORKS; FEATURES; SITES;
D O I
10.1371/journal.pone.0065587
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein-protein interactions (PPIs) are essential for understanding the function of biological systems and have been characterized using a vast array of experimental techniques. These techniques detect only a small proportion of all PPIs and are labor intensive and time consuming. Therefore, the development of computational methods capable of predicting PPIs accelerates the pace of discovery of new interactions. This paper reports a machine learning-based prediction model, the Universal In Silico Predictor of Protein-Protein Interactions (UNISPPI), which is a decision tree model that can reliably predict PPIs for all species (including proteins from parasite-host associations) using only 20 combinations of amino acids frequencies from interacting and non-interacting proteins as learning features. UNISPPI was able to correctly classify 79.4% and 72.6% of experimentally supported interactions and non-interacting protein pairs, respectively, from an independent test set. Moreover, UNISPPI suggests that the frequencies of the amino acids asparagine, cysteine and isoleucine are important features for distinguishing between interacting and non-interacting protein pairs. We envisage that UNISPPI can be a useful tool for prioritizing interactions for experimental validation.
引用
收藏
页数:11
相关论文
共 55 条
  • [51] Predicting protein-protein interactions in unbalanced data using the primary structure of proteins
    Yu, Chi-Yuan
    Chou, Lih-Ching
    Chang, Darby Tien-Hao
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [52] Simple sequence-based kernels do not predict protein-protein interactions
    Yu, Jiantao
    Guo, Maozu
    Needham, Chris J.
    Huang, Yangchao
    Cai, Lu
    Westhead, David R.
    [J]. BIOINFORMATICS, 2010, 26 (20) : 2610 - 2614
  • [53] Protein-protein interaction based on pairwise similarity
    Zaki, Nazar
    Lazarova-Molnar, Sanja
    El-Hajj, Wassim
    Campbell, Piers
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [54] Predicting co-complexed protein pairs using genomic and proteomic data integration
    Zhang, LV
    Wong, SL
    King, OD
    Roth, FP
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)
  • [55] Can simple codon pair usage predict protein-protein interaction?
    Zhou, Yuan
    Zhou, Ying-Si
    He, Fei
    Song, Jiangning
    Zhang, Ziding
    [J]. MOLECULAR BIOSYSTEMS, 2012, 8 (05) : 1396 - 1404