A KNOWLEDGE BASE FOR PREDICTING PROTEIN LOCALIZATION SITES IN EUKARYOTIC CELLS

被引:1331
作者
NAKAI, K [1 ]
KANEHISA, M [1 ]
机构
[1] KYOTO UNIV, INST CHEM RES, UJI, KYOTO 611, JAPAN
关键词
D O I
10.1016/S0888-7543(05)80111-9
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
To automate examination of massive amounts of sequence data for biological function, it is important to computerize interpretation based on empirical knowledge of sequence-function relationships. For this purpose, we have been constructing a knowledge base by organizing various experimental and computational observations as a collection of if-then rules. Here we report an expert system, which utilizes this knowledge base, for predicting localization sites of proteins only from the information on the amino acid sequence and the source origin. We collected data for 401 eukaryotic proteins with known localization sites (subcellular and extracellular) and divided them into training data and testing data. Fourteen localization sites were distinguished for animal cells and 17 for plant cells. When sorting signals were not well characterized experimentally, various sequence features were computationally derived from the training data. It was found that 66% of the training data and 59% of the testing data were correctly predicted by our expert system. This artificial intelligence approach is powerful and flexible enough to be used in genome analyses. © 1992 Academic Press, Inc. All rights reserved.
引用
收藏
页码:897 / 911
页数:15
相关论文
共 64 条
  • [1] SEQUENCE IDENTIFICATION OF 2,375 HUMAN BRAIN GENES
    ADAMS, MD
    DUBNICK, M
    KERLAVAGE, AR
    MORENO, R
    KELLEY, JM
    UTTERBACK, TR
    NAGLE, JW
    FIELDS, C
    VENTER, JC
    [J]. NATURE, 1992, 355 (6361) : 632 - 634
  • [2] MITOCHONDRIAL PROTEINS ESSENTIAL FOR VIABILITY MEDIATE PROTEIN IMPORT INTO YEAST MITOCHONDRIA
    BAKER, KP
    SCHATZ, G
    [J]. NATURE, 1991, 349 (6306) : 205 - 208
  • [3] GENERATION OF A LYSOSOMAL-ENZYME TARGETING SIGNAL IN THE SECRETORY PROTEIN PEPSINOGEN
    BARANSKI, TJ
    FAUST, PL
    KORNFELD, S
    [J]. CELL, 1990, 63 (02) : 281 - 291
  • [4] BARKER WC, 1990, METHOD ENZYMOL, V183, P31
  • [5] A COMMON PEPTIDE STRETCH AMONG ENZYMES LOCALIZED TO THE GOLGI-APPARATUS - STRUCTURAL SIMILARITY OF GOLGI-ASSOCIATED GLYCOSYLTRANSFERASES
    BENDIAK, B
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 1990, 170 (02) : 879 - 882
  • [7] CHEN WJ, 1990, J BIOL CHEM, V265, P3116
  • [8] SHORT PEPTIDE DOMAINS TARGET PROTEINS TO PLANT VACUOLES
    CHRISPEELS, MJ
    RAIKHEL, NV
    [J]. CELL, 1992, 68 (04) : 613 - 616
  • [9] TRANSFERRIN RECEPTOR INTERNALIZATION SEQUENCE YXRF IMPLICATES A TIGHT TURN AS THE STRUCTURAL RECOGNITION MOTIF FOR ENDOCYTOSIS
    COLLAWN, JF
    STANGEL, M
    KUHN, LA
    ESEKOGWU, V
    JING, SQ
    TROWBRIDGE, IS
    TAINER, JA
    [J]. CELL, 1990, 63 (05) : 1061 - 1072
  • [10] GLYCOLIPID ANCHORING OF PLASMA-MEMBRANE PROTEINS
    CROSS, GAM
    [J]. ANNUAL REVIEW OF CELL BIOLOGY, 1990, 6 : 1 - 39