Highly Efficient Framework for Predicting Interactions Between Proteins

被引:84
作者
You, Zhu-Hong [1 ]
Zhou, MengChu [2 ,3 ]
Luo, Xin [4 ]
Li, Shuai [5 ]
机构
[1] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[2] Macau Univ Sci & Technol, Inst Syst Engn, Macau 999078, Peoples R China
[3] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
[4] Chinese Acad Sci, Chongqing Inst Green & Intelligent Technol, Chongqing 400714, Peoples R China
[5] Hong Kong Polytech Univ, Dept Comp, Hong Kong 999077, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Big data; feature extraction; kernel extreme learning machine (K-ELM); low-rank approximation (LRA); protein-protein interactions (PPIs); support vector machine (SVM); EXTREME LEARNING-MACHINE; AMINO-ACID-COMPOSITION; NEURAL-NETWORKS; MODEL; CLASSIFICATION; SYSTEMS; PROTEOME; OPTIMIZATION; COMPLEXES; SEQUENCES;
D O I
10.1109/TCYB.2016.2524994
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Protein-protein interactions (PPIs) play a central role in many biological processes. Although a large amount of human PPI data has been generated by high-throughput experimental techniques, they are very limited compared to the estimated 130 000 protein interactions in humans. Hence, automatic methods for human PPI-detection are highly desired. This work proposes a novel framework, i. e., Low-rank approximationkernel Extreme Learning Machine (LELM), for detecting human PPI from a protein's primary sequences automatically. It has three main steps: 1) mapping each protein sequence into a matrix built on all kinds of adjacent amino acids; 2) applying the low-rank approximation model to the obtained matrix to solve its lowest rank representation, which reflects its true subspace structures; and 3) utilizing a powerful kernel extreme learning machine to predict the probability for PPI based on this lowest rank representation. Experimental results on a large-scale human PPI dataset demonstrate that the proposed LELM has significant advantages in accuracy and efficiency over the state-of-art approaches. Hence, this work establishes a new and effective way for the automatic detection of PPI.
引用
收藏
页码:731 / 743
页数:13
相关论文
共 78 条
[1]   Sparse Extreme Learning Machine for Classification [J].
Bai, Zuo ;
Huang, Guang-Bin ;
Wang, Danwei ;
Wang, Han ;
Westover, M. Brandon .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (10) :1858-1870
[2]   Choosing negative examples for the prediction of protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BMC BIOINFORMATICS, 2006, 7 (Suppl 1)
[3]   Robust Subspace Segmentation Via Low-Rank Representation [J].
Chen, Jinhui ;
Yang, Jian .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (08) :1432-1445
[4]   In Silico Prediction of Human Protein Interactions Using Fuzzy-SVM Mixture Models and Its Application to Cancer Research [J].
Chiang, Jung-Hsien ;
Lee, Tsung-Lu Michael .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2008, 16 (04) :1087-1095
[5]   Predicting Protein Function by Frequent Functional Association Pattern Mining in Protein Interaction Networks [J].
Cho, Young-Rae ;
Zhang, Aidong .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2010, 14 (01) :30-36
[6]   Stability Analysis of Switched Fuzzy Systems Via Model Checking [J].
Ding, Zuohua ;
Zhou, Yuan ;
Zhou, MengChu .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (06) :1503-1514
[7]   An Accurate de novo Algorithm for Glycan Topology Determination from Mass Spectra [J].
Dong, Liang ;
Shi, Bing ;
Tian, Guangdong ;
Li, YanBo ;
Wang, Bing ;
Zhou, MengChu .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (03) :568-578
[8]   Gaussian Classifier-Based Evolutionary Strategy for Multimodal Optimization [J].
Dong, Wenyong ;
Zhou, MengChu .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (06) :1200-1216
[9]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[10]   Predicting DNA Motifs by Using Evolutionary Multiobjective Optimization [J].
Gonzalez-Alvarez, David L. ;
Vega-Rodriguez, Miguel A. ;
Gomez-Pulido, Juan A. ;
Sanchez-Perez, Juan M. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06) :913-925