Prediction of protein-protein interactions using random decision forest framework

被引:298
作者
Chen, XW [1 ]
Liu, M [1 ]
机构
[1] Univ Kansas, Bioinformat & Computat Life Sci Lab, ITTC, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/bti721
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein interactions are of biological interest because they orchestrate a number of cellular processes such as metabolic pathways and immunological recognition. Domains are the building blocks of proteins; therefore, proteins are assumed to interact as a result of their interacting domains. Many domain-based models for protein interaction prediction have been developed, and preliminary results have demonstrated their feasibility. Most of the existing domain-based methods, however, consider only single-domain pairs (one domain from one protein) and assume independence between domain-domain interactions. Results: In this paper, we introduce a domain-based random forest of decision trees to infer protein interactions. Our proposed method is capable of exploring all possible domain interactions and making predictions based on all the protein domains. Experimental results on Saccharomyces cerevisiae dataset demonstrate that our approach can predict protein-protein interactions with higher sensitivity (79.78%) and specificity (64.38%) compared with that of the maximum likelihood approach. Furthermore, our model can be used to infer interactions not only for single-domain pairs but also for multiple domain pairs.
引用
收藏
页码:4394 / 4400
页数:7
相关论文
共 37 条
[1]   Hsk1-Dfp1 is required for heterochromatin-mediated cohesion at centromeres [J].
Bailis, JM ;
Bernard, P ;
Antonelli, R ;
Allshire, RC ;
Forsburg, SL .
NATURE CELL BIOLOGY, 2003, 5 (12) :1111-1116
[2]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[3]   Predicting protein-protein interactions from primary structure [J].
Bock, JR ;
Gough, DA .
BIOINFORMATICS, 2001, 17 (05) :455-460
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Conservation of gene order: a fingerprint of proteins that physically interact [J].
Dandekar, T ;
Snel, B ;
Huynen, M ;
Bork, P .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) :324-328
[6]   Inferring domain-domain interactions from protein-protein interactions [J].
Deng, MH ;
Mehta, S ;
Sun, FZ ;
Chen, T .
GENOME RESEARCH, 2002, 12 (10) :1540-1548
[7]   Protein interaction maps for complete genomes based on gene fusion events [J].
Enright, AJ ;
Iliopoulos, I ;
Kyrpides, NC ;
Ouzounis, CA .
NATURE, 1999, 402 (6757) :86-90
[8]   Prediction of protein-protein interaction sites in heterocomplexes with neural networks [J].
Fariselli, P ;
Pazos, F ;
Valencia, A ;
Casadio, R .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 2002, 269 (05) :1356-1361
[9]   iPfam:: visualization of protein-protein interactions in PDB at domain and amino acid resolutions [J].
Finn, RD ;
Marshall, M ;
Bateman, A .
BIOINFORMATICS, 2005, 21 (03) :410-412
[10]   Co-evolution of proteins with their interaction partners [J].
Goh, CS ;
Bogan, AA ;
Joachimiak, M ;
Walther, D ;
Cohen, FE .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 299 (02) :283-293