Refined phylogenetic profiles method for predicting protein-protein interactions

被引:80
作者
Sun, JC
Xu, JL
Liu, Z
Liu, Q
Zhao, AM
Shi, TL [1 ]
Li, YX
机构
[1] Chinese Acad Sci, Shanghai Inst Biol Sci, Bioinformat Ctr, Shanghai 200031, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Life Sci & Technol, Shanghai 200240, Peoples R China
[3] Hunan Normal Univ, Dept Biol, Changsha 410081, Peoples R China
[4] Chinese Natl Ctr Biotechnol Dev, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1093/bioinformatics/bti532
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The increasing availability of complete genome sequences provides excellent opportunity for the further development of tools for functional studies in proteomics. Several experimental approaches and in silico algorithms have been developed to cluster proteins into networks of biological significance that may provide new biological insights, especially into understanding the functions of many uncharacterized proteins. Among these methods, the phylogenetic profiles method has been widely used to predict protein-protein interactions. It involves the selection of reference organisms and identification of homologous proteins. Up to now, no published report has systematically studied the effects of the reference genome selection and the identification of homologous proteins upon the accuracy of this method. Results: In this study, we optimized the phylogenetic profiles method by integrating phylogenetic relationships among reference organisms and sequence homology information to improve prediction accuracy. Our results revealed that the selection of the reference organisms set and the criteria for homology identification significantly are two critical factors for the prediction accuracy of this method. Our refined phylogenetic profiles method shows greater performance and potentially provides more reliable functional linkages compared with previous methods.
引用
收藏
页码:3409 / 3415
页数:7
相关论文
共 26 条
[11]   The KEGG resource for deciphering the genome [J].
Kanehisa, M ;
Goto, S ;
Kawashima, S ;
Okuno, Y ;
Hattori, M .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D277-D280
[12]   EcoCyc:: a comprehensive database resource for Escherichia coli [J].
Keseler, IM ;
Collado-Vides, J ;
Gama-Castro, S ;
Ingraham, J ;
Paley, S ;
Paulsen, IT ;
Peralta-Gill, M ;
Karp, PD .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D334-D337
[13]   Localizing proteins in the cell from their phylogenetic profiles [J].
Marcotte, EM ;
Xenarios, I ;
van der Bliek, AM ;
Eisenberg, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) :12115-12120
[14]   A combined algorithm for genome-wide prediction of protein function [J].
Marcotte, EM ;
Pellegrini, M ;
Thompson, MJ ;
Yeates, TO ;
Eisenberg, D .
NATURE, 1999, 402 (6757) :83-86
[15]   Detecting protein function and protein-protein interactions from genome sequences [J].
Marcotte, EM ;
Pellegrini, M ;
Ng, HL ;
Rice, DW ;
Yeates, TO ;
Eisenberg, D .
SCIENCE, 1999, 285 (5428) :751-753
[16]   Guilt-by-association goes global [J].
Oliver, S .
NATURE, 2000, 403 (6770) :601-603
[17]   The use of gene clusters to infer functional coupling [J].
Overbeek, R ;
Fonstein, M ;
D'Souza, M ;
Pusch, GD ;
Maltsev, N .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (06) :2896-2901
[18]   Assigning protein functions by comparative genome analysis: Protein phylogenetic profiles [J].
Pellegrini, M ;
Marcotte, EM ;
Thompson, MJ ;
Eisenberg, D ;
Yeates, TO .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (08) :4285-4288
[19]   The Database of Interacting Proteins: 2004 update [J].
Salwinski, L ;
Miller, CS ;
Smith, AJ ;
Pettit, FK ;
Bowie, JU ;
Eisenberg, D .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D449-D451
[20]   A network of protein-protein interactions in yeast [J].
Schwikowski, B ;
Uetz, P ;
Fields, S .
NATURE BIOTECHNOLOGY, 2000, 18 (12) :1257-1261