Inference of protein function from protein structure

被引:131
作者
Pal, D [1 ]
Eisenberg, D [1 ]
机构
[1] Univ Calif Los Angeles, DOE, Inst Genom & Prote, Los Angeles, CA 90095 USA
关键词
D O I
10.1016/j.str.2004.10.015
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural genomics has brought us three-dimensional structures of proteins with unknown functions. To shed light on such structures, we have developed ProKnow (http://www.doe-mbi.ucia.edu/ServicestProKnow), which annotates proteins with Gene Ontology functional terms. The method extracts features from the protein such as 3D fold, sequence, motif, and functional linkages and relates them to function via the ProKnow knowledgebase of features, which links features to annotated functions via annotation profiles. Bayes' theorem is used to compute weights of the functions assigned, using likelihoods based on the extracted features. The description level of the assigned function is quantified by the ontology depth (from 1 = general to 9 = specific). Jackknife tests show 89% correct assignments at ontology depth 1 and 40% at depth 9, with 93% coverage of 1507 distinct folded proteins. Overall, about 70% of the assignments were inferred correctly. This level of performance suggests that ProKnow is a useful resource in functional assessments of novel proteins.
引用
收藏
页码:121 / 130
页数:10
相关论文
共 40 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] [Anonymous], PROBABILITY
  • [3] Ashburner M, 2001, GENOME RES, V11, P1425
  • [4] PRINTS and its automatic supplement, prePRINTS
    Attwood, TK
    Bradley, P
    Flower, DR
    Gaulton, A
    Maudling, N
    Mitchell, AL
    Moulton, G
    Nordle, A
    Paine, K
    Taylor, P
    Uddin, A
    Zygouri, C
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 400 - 402
  • [5] The reporter's privilege, then and now
    Bates, S
    [J]. SOCIETY, 2001, 38 (05) : 41 - 54
  • [6] Bork P, 1996, METHOD ENZYMOL, V266, P162
  • [7] A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE
    BOWIE, JU
    LUTHY, R
    EISENBERG, D
    [J]. SCIENCE, 1991, 253 (5016) : 164 - 170
  • [8] Protein function classification via support vector machine approach
    Cai, CZ
    Wang, WL
    Sun, LZ
    Chen, YZ
    [J]. MATHEMATICAL BIOSCIENCES, 2003, 185 (02) : 111 - 122
  • [9] Devos D, 2000, PROTEINS, V41, P98, DOI 10.1002/1097-0134(20001001)41:1<98::AID-PROT120>3.0.CO
  • [10] 2-S