Predicting Protein Function by Frequent Functional Association Pattern Mining in Protein Interaction Networks

被引:45
作者
Cho, Young-Rae [1 ]
Zhang, Aidong [2 ]
机构
[1] Baylor Univ, Dept Comp Sci, Waco, TX 76798 USA
[2] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
来源
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE | 2010年 / 14卷 / 01期
关键词
Function prediction; protein-protein interactions; protein interaction networks; INTEGRATION; ANNOTATION; ALGORITHM;
D O I
10.1109/TITB.2009.2028234
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting protein function from protein interaction networks has been challenging because of the complexity of functional relationships among proteins. Most previous function prediction methods depend on the neighborhood of or the connected paths to known proteins. However, their accuracy has been limited due to the functional inconsistency of interacting proteins. In this paper, we propose a novel approach for function prediction by identifying frequent patterns of functional associations in a protein interaction network. A set of functions that a protein performs is assigned into the corresponding node as a label. A functional association pattern is then represented as a labeled subgraph. Our frequent labeled subgraph mining algorithm efficiently searches the functional association patterns that occur frequently in the network. It iteratively increases the size of frequent patterns by one node at a time by selective joining, and simplifies the network by a priori pruning. Using the yeast protein interaction network, our algorithm found more than 1400 frequent functional association patterns. The function prediction is performed by matching the subgraph, including the unknown protein, with the frequent patterns analogous to it. By leave-one-out cross validation, we show that our approach has better performance than previous link-based methods in terms of prediction accuracy. The frequent functional association patterns generated in this study might become the foundations of advanced analysis for functional behaviors of proteins in a system level.
引用
收藏
页码:30 / 36
页数:7
相关论文
共 21 条
[1]  
Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
[2]   Semantic integration to identify overlapping functional modules in protein interaction networks [J].
Cho, Young-Rae ;
Hwang, Woochang ;
Ramanathan, Murali ;
Zhang, Aidong .
BMC BIOINFORMATICS, 2007, 8 (1)
[3]   A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge [J].
Cho, Young-Rae ;
Shi, Lei ;
Ramanathan, Murali ;
Zhang, Aidong .
BMC BIOINFORMATICS, 2008, 9 (1)
[4]   Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions [J].
Chua, Hon Nian ;
Sung, Wing-Kin ;
Wong, Limsoon .
BIOINFORMATICS, 2006, 22 (13) :1623-1630
[5]   Prediction of protein function using protein-protein interaction data [J].
Deng, MH ;
Zhang, K ;
Mehta, S ;
Chen, T ;
Sun, FZ .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (06) :947-960
[6]  
Hishigaki H, 2001, YEAST, V18, P523, DOI 10.1002/yea.706.abs
[7]  
Huan J, 2003, THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P549
[8]   Whole-genome annotation by using evidence integration in functional-linkage networks [J].
Karaoz, U ;
Murali, TM ;
Letovsky, S ;
Zheng, Y ;
Ding, CM ;
Cantor, CR ;
Kasif, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (09) :2888-2893
[9]   Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs [J].
Kashtan, N ;
Itzkovitz, S ;
Milo, R ;
Alon, U .
BIOINFORMATICS, 2004, 20 (11) :1746-1758
[10]  
Kirac M, 2008, LECT N BIOINFORMAT, V4955, P197