Prediction of protein function using protein-protein interaction data

被引:182
作者
Deng, MH [1 ]
Zhang, K [1 ]
Mehta, S [1 ]
Chen, T [1 ]
Sun, FZ [1 ]
机构
[1] Univ So Calif, Dept Biol Sci, Program Mol & Computat Biol, Los Angeles, CA 90089 USA
关键词
protein-protein interaction; pretein function; Markov random field; Bayesian method; Gibbs sampler;
D O I
10.1089/106652703322756168
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Assigning functions to novel proteins is one of the most important problems in the postgenomic era. Several approaches have been applied to this problem, including the analysis of gene expression patterns, phylogenetic profiles, protein fusions, and protein-protein interactions. In this paper, we develop a novel approach that employs the theory of Markov random fields to infer a protein's functions using protein-protein interaction data and the functional annotations of protein's interaction partners. For each function of interest and protein, we predict the probability that the protein has such function using Bayesian approaches. Unlike other available approaches for protein annotation in which a protein has or does not have a function of interest, we give a probability for having the function. This probability indicates how confident we are about the prediction. We employ our method to predict protein functions based on "biochemical function," "subcellular location," and "cellular role" for yeast proteins defined in the Yeast Proteome Database (YPD, www.incyte.com), using the protein-protein interaction data from the Munich Information Center for Protein Sequences (MIPS, mips.gsf.de). We show that our approach outperforms other available methods for function prediction based on protein interaction data. The supplementary data is available at www-hto.usc.edu/similar tomsms/ProteinFunction.
引用
收藏
页码:947 / 960
页数:14
相关论文
共 26 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [3] Machine learning of functional class from phenotype data
    Clare, A
    King, RD
    [J]. BIOINFORMATICS, 2002, 18 (01) : 160 - 166
  • [4] YPD™, PombePD™ and WormPD™:: model organism volumes of the BioKnowledge™ Library, an integrated resource for protein information
    Costanzo, MC
    Crawford, ME
    Hirschman, JE
    Kranz, JE
    Olsen, P
    Robertson, LS
    Skrzypek, MS
    Braun, BR
    Hopkins, KL
    Kondu, P
    Lengieza, C
    Lew-Smith, JE
    Tillberg, M
    Garrels, JI
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 75 - 79
  • [5] First InP/InGaAs PNPHBT grown by metal organic chemical vapor deposition
    Cui, DL
    Hsu, S
    Pavlidis, D
    [J]. 2001 INTERNATIONAL CONFERENCE ON INDIUM PHOSPHIDE AND RELATED MATERIALS, CONFERENCE PROCEEDINGS, 2001, : 224 - 227
  • [6] Protein interactions - Two methods for assessment of the reliability of high throughput observations
    Deane, CM
    Salwinski, L
    Xenarios, I
    Eisenberg, D
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (05) : 349 - 356
  • [7] Deng Minghua, 2003, Pac Symp Biocomput, P140
  • [8] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [9] Fellenberg M, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P152
  • [10] Functional organization of the yeast proteome by systematic analysis of protein complexes
    Gavin, AC
    Bösche, M
    Krause, R
    Grandi, P
    Marzioch, M
    Bauer, A
    Schultz, J
    Rick, JM
    Michon, AM
    Cruciat, CM
    Remor, M
    Höfert, C
    Schelder, M
    Brajenovic, M
    Ruffner, H
    Merino, A
    Klein, K
    Hudak, M
    Dickson, D
    Rudi, T
    Gnau, V
    Bauch, A
    Bastuck, S
    Huhse, B
    Leutwein, C
    Heurtier, MA
    Copley, RR
    Edelmann, A
    Querfurth, E
    Rybin, V
    Drewes, G
    Raida, M
    Bouwmeester, T
    Bork, P
    Seraphin, B
    Kuster, B
    Neubauer, G
    Superti-Furga, G
    [J]. NATURE, 2002, 415 (6868) : 141 - 147