DISCRIMINATION OF INTRACELLULAR AND EXTRACELLULAR PROTEINS USING AMINO-ACID-COMPOSITION AND RESIDUE-PAIR FREQUENCIES

被引:351
作者
NAKASHIMA, H [1 ]
NISHIKAWA, K [1 ]
机构
[1] PROT ENGN RES INST,SUITA,OSAKA 565,JAPAN
关键词
INTRACELLULAR AND EXTRACELLULAR PROTEINS; RESIDUE-PAIR; COMPOSITIONS; DISCRIMINANT ANALYSIS;
D O I
10.1006/jmbi.1994.1267
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Sequences of intracellular and extracellular soluble proteins were analyzed statistically in terms of amino acid composition and residue-pair frequencies. Residue-pair frequencies were calculated for sequential separations from (n, n + 1) to (n, n + 5), and converted into scoring parameters. Then, for each test protein, the single-residue and residue-pair parameters were applied to calculate a total score. According to our definition, a protein which yields a positive score is indicative of an intracellular protein, whereas a negative score implies an extracellular one. The parameter set was derived from 894 sequences constituting different protein families in the PIR database, and assessed by application to a test of 379 proteins. The results showed that 88% of intracellular and 84% of extracellular proteins were correctly assigned. The discrimination power was improved by about 8% in comparison with the previous study, which used composition data alone. Segregation of intra/ extracellular proteins is also observed by other criteria, such as structural class (intracellular proteins prefer α and α/α types and extracellular proteins prefer α and α+α types). Segregation by sequence was found to be a more reliable procedure for distinguishing intra/ extracellular proteins than methods using structural class. Possible causes for this segregation by sequence are discussed. © 1994 Academic Press Limited.
引用
收藏
页码:54 / 61
页数:8
相关论文
共 13 条
  • [1] BARKER WC, 1990, METHOD ENZYMOL, V183, P31
  • [2] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [3] CONSIDERATION OF POSSIBILITY THAT SLOW STEP IN PROTEIN DENATURATION REACTIONS IS DUE TO CIS-TRANS ISOMERISM OF PROLINE RESIDUES
    BRANDTS, JF
    HALVORSON, HR
    BRENNAN, M
    [J]. BIOCHEMISTRY, 1975, 14 (22) : 4953 - 4963
  • [4] THE DETECTION AND CLASSIFICATION OF MEMBRANE-SPANNING PROTEINS
    KLEIN, P
    KANEHISA, M
    DELISI, C
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA, 1985, 815 (03) : 468 - 476
  • [5] STRUCTURAL PATTERNS IN GLOBULAR PROTEINS
    LEVITT, M
    CHOTHIA, C
    [J]. NATURE, 1976, 261 (5561) : 552 - 558
  • [6] THE FOLDING TYPE OF A PROTEIN IS RELEVANT TO THE AMINO-ACID-COMPOSITION
    NAKASHIMA, H
    NISHIKAWA, K
    OOI, T
    [J]. JOURNAL OF BIOCHEMISTRY, 1986, 99 (01) : 153 - 162
  • [7] THE AMINO-ACID-COMPOSITION IS DIFFERENT BETWEEN THE CYTOPLASMIC AND EXTRACELLULAR SIDES IN MEMBRANE-PROTEINS
    NAKASHIMA, H
    NISHIKAWA, K
    [J]. FEBS LETTERS, 1992, 303 (2-3) : 141 - 146
  • [8] METHOD FOR CLUSTERING PROTEINS BY USE OF ALL POSSIBLE PAIRS OF AMINO-ACIDS AS STRUCTURAL DESCRIPTORS
    NAKAYAMA, SI
    SHIGEZUMI, S
    YOSHIDA, M
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1988, 28 (02): : 72 - 78
  • [9] CLASSIFICATION OF PROTEINS INTO GROUPS BASED ON AMINO-ACID-COMPOSITION AND OTHER CHARACTERS .2. GROUPING INTO 4 TYPES
    NISHIKAWA, K
    KUBOTA, Y
    OOI, T
    [J]. JOURNAL OF BIOCHEMISTRY, 1983, 94 (03) : 997 - 1007
  • [10] PRINCE RC, 1993, TRENDS BIOCHEM SCI, V18, P153