PREDICTION OF PROTEIN FOLDING CLASS FROM AMINO-ACID-COMPOSITION

被引:69
作者
DUBCHAK, I
HOLBROOK, SR
KIM, SH
机构
[1] LAWRENCE BERKELEY LAB,DEPT CHEM,BERKELEY,CA 94720
[2] LAWRENCE BERKELEY LAB,DIV STRUCT BIOL,BERKELEY,CA 94720
来源
PROTEINS-STRUCTURE FUNCTION AND GENETICS | 1993年 / 16卷 / 01期
关键词
PROTEIN STRUCTURE PREDICTION; NEURAL NETWORKS; AMINO ACID COMPOSITION; PROTEIN FOLDING CLASSES; 4-ALPHA-HELICAL BUNDLES; PARALLEL (ALPHA/BETA)8 BARRELS; NUCLEOTIDE BINDING FOLD; IMMUNOGLOBULIN FOLD;
D O I
10.1002/prot.340160109
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
An empirical relation between the amino acid composition and three-dimensional folding pattern of several classes of proteins has been determined. Computer simulated neural networks have been used to assign proteins to one of the following classes based on their amino acid composition and size: (1) 4alpha-helical bundles, (2) parallel (alpha/beta)8 barrels, (3) nucleotide binding fold, (4) immunoglobulin fold, or (5) none of these. Networks trained on the known crystal structures as well as sequences of closely related proteins are shown to correctly predict folding classes of proteins not represented in the training set with an average accuracy of 87%. Other folding motifs can easily be added to the prediction scheme once larger databases become available. Analysis of the neural network weights reveals that amino acids favoring prediction of a folding class are usually over represented in that class and amino acids with unfavorable weights are underrepresented in composition. The neural networks utilize combinations of these multiple small variations in amino acid composition in order to make a prediction. The favorably weighted amino acids in a given class also form the most intramolecular interactions with other residues in proteins of that class. A detailed examination of the contacts of these amino acids reveals some general patterns that may help stabilize each folding class.
引用
收藏
页码:79 / 91
页数:13
相关论文
共 39 条
  • [1] 3-DIMENSIONAL STRUCTURE OF A GENETICALLY ENGINEERED VARIANT OF PORCINE GROWTH-HORMONE
    ABDELMEGUID, SS
    SHIEH, HS
    SMITH, WW
    DAYRINGER, HE
    VIOLAND, BN
    BENTLE, LA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (18) : 6434 - 6437
  • [2] THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK
    BAIROCH, A
    BOECKMANN, B
    [J]. NUCLEIC ACIDS RESEARCH, 1991, 19 : 2247 - 2248
  • [3] STRUCTURE OF THE CO1E1 ROP PROTEIN AT 1.7 A RESOLUTION
    BANNER, DW
    KOKKINIDIS, M
    TSERNOGLOU, D
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1987, 196 (03) : 657 - 675
  • [4] BENGIO Y, 1990, COMPUT APPL BIOSCI, V6, P319
  • [5] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [6] BLOOMER AC, 1978, NATURE, V276, P361
  • [7] A NOVEL-APPROACH TO PREDICTION OF THE 3-DIMENSIONAL STRUCTURES OF PROTEIN BACKBONES BY NEURAL NETWORKS
    BOHR, H
    BOHR, J
    BRUNAK, S
    COTTERILL, RMJ
    FREDHOLM, H
    LAUTRUP, B
    PETERSEN, SB
    [J]. FEBS LETTERS, 1990, 261 (01) : 43 - 46
  • [8] 3-DIMENSIONAL STRUCTURE OF INTERLEUKIN-2
    BRANDHUBER, BJ
    BOONE, T
    KENNEY, WC
    MCKAY, DB
    [J]. SCIENCE, 1987, 238 (4834) : 1707 - 1709
  • [9] PREDICTION OF PROTEIN CONFORMATION
    CHOU, PY
    FASMAN, GD
    [J]. BIOCHEMISTRY, 1974, 13 (02) : 222 - 245
  • [10] CHOU PY, 1989, PREDICTION PROTEIN S, P549