Refined phylogenetic profiles method for predicting protein-protein interactions

被引:80
作者
Sun, JC
Xu, JL
Liu, Z
Liu, Q
Zhao, AM
Shi, TL [1 ]
Li, YX
机构
[1] Chinese Acad Sci, Shanghai Inst Biol Sci, Bioinformat Ctr, Shanghai 200031, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Life Sci & Technol, Shanghai 200240, Peoples R China
[3] Hunan Normal Univ, Dept Biol, Changsha 410081, Peoples R China
[4] Chinese Natl Ctr Biotechnol Dev, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1093/bioinformatics/bti532
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The increasing availability of complete genome sequences provides excellent opportunity for the further development of tools for functional studies in proteomics. Several experimental approaches and in silico algorithms have been developed to cluster proteins into networks of biological significance that may provide new biological insights, especially into understanding the functions of many uncharacterized proteins. Among these methods, the phylogenetic profiles method has been widely used to predict protein-protein interactions. It involves the selection of reference organisms and identification of homologous proteins. Up to now, no published report has systematically studied the effects of the reference genome selection and the identification of homologous proteins upon the accuracy of this method. Results: In this study, we optimized the phylogenetic profiles method by integrating phylogenetic relationships among reference organisms and sequence homology information to improve prediction accuracy. Our results revealed that the selection of the reference organisms set and the criteria for homology identification significantly are two critical factors for the prediction accuracy of this method. Our refined phylogenetic profiles method shows greater performance and potentially provides more reliable functional linkages compared with previous methods.
引用
收藏
页码:3409 / 3415
页数:7
相关论文
共 26 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Auerbach D, 2002, PROTEOMICS, V2, P611, DOI 10.1002/1615-9861(200206)2:6<611::AID-PROT611>3.0.CO
[3]  
2-Y
[4]   Computational analyses of high-throughput protein-protein interaction data [J].
Chen, Y ;
Xu, D .
CURRENT PROTEIN & PEPTIDE SCIENCE, 2003, 4 (03) :159-180
[5]   Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages [J].
Date, SV ;
Marcotte, EM .
NATURE BIOTECHNOLOGY, 2003, 21 (09) :1055-1062
[6]   Protein function in the post-genomic era [J].
Eisenberg, D ;
Marcotte, EM ;
Xenarios, I ;
Yeates, TO .
NATURE, 2000, 405 (6788) :823-826
[7]   Annotation of bacterial genomes using improved phylogenomic profiles [J].
Enault, F. ;
Suhre, K. ;
Abergel, C. ;
Poirot, O. ;
Claverie, J. -M. .
BIOINFORMATICS, 2003, 19 :i105-i107
[8]   Protein interaction maps for complete genomes based on gene fusion events [J].
Enright, AJ ;
Iliopoulos, I ;
Kyrpides, NC ;
Ouzounis, CA .
NATURE, 1999, 402 (6757) :86-90
[9]   A NOVEL GENETIC SYSTEM TO DETECT PROTEIN PROTEIN INTERACTIONS [J].
FIELDS, S ;
SONG, OK .
NATURE, 1989, 340 (6230) :245-246
[10]  
Gaasterland T, 1998, Microb Comp Genomics, V3, P199