Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae

被引:78
作者
Chen, Y
Xu, D [1 ]
机构
[1] UT ORNL, Grad Sch Genome Sci & Technol, Oak Ridge, TN 37830 USA
[2] Univ Missouri, Dept Comp Sci, Digital Biol Lab, Columbia, MO 65211 USA
关键词
D O I
10.1093/nar/gkh978
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As we are moving into the post genome-sequencing era, various high-throughput experimental techniques have been developed to characterize biological systems on the genomic scale. Discovering new biological knowledge from the high-throughput biological data is a major challenge to bioinformatics today. To address this challenge, we developed a Bayesian statistical method together with Boltzmann machine and simulated annealing for protein functional annotation in the yeast Saccharomyces cerevisiae through integrating various high-throughput biological data, including yeast two-hybrid data, protein complexes and microarray gene expression profiles. In our approach, we quantified the relationship between functional similarity and high-throughput data, and coded the relationship into 'functional linkage graph', where each node represents one protein and the weight of each edge is characterized by the Bayesian probability of function similarity between two proteins. We also integrated the evolution information and protein subcellular localization information into the prediction. Based on our method, 1802 out of 2280 unannotated proteins in yeast were assigned functions systematically.
引用
收藏
页码:6414 / 6424
页数:11
相关论文
共 37 条
  • [1] ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [4] Predicting protein complex membership using probabilistic network reliability
    Asthana, S
    King, OD
    Gibbons, FD
    Roth, FP
    [J]. GENOME RESEARCH, 2004, 14 (06) : 1170 - 1175
  • [5] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [6] Computational analyses of high-throughput protein-protein interaction data
    Chen, Y
    Xu, D
    [J]. CURRENT PROTEIN & PEPTIDE SCIENCE, 2003, 4 (03) : 159 - 180
  • [7] CHEN Y, 2004, IN PRESS BIOINFORMAT
  • [8] THE 2-HYBRID SYSTEM - A METHOD TO IDENTIFY AND CLONE GENES FOR PROTEINS THAT INTERACT WITH A PROTEIN OF INTEREST
    CHIEN, CT
    BARTEL, PL
    STERNGLANZ, R
    FIELDS, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (21) : 9578 - 9582
  • [9] Predicting gene function in Saccharomyces cerevisiae
    Clare, A.
    King, R. D.
    [J]. BIOINFORMATICS, 2003, 19 : II42 - II49
  • [10] Inferring domain-domain interactions from protein-protein interactions
    Deng, MH
    Mehta, S
    Sun, FZ
    Chen, T
    [J]. GENOME RESEARCH, 2002, 12 (10) : 1540 - 1548