CONSERVATION AND PREDICTION OF SOLVENT ACCESSIBILITY IN PROTEIN FAMILIES

被引:512
作者
ROST, B
SANDER, C
机构
[1] Protein Design Group, European Molecular Biology Laboratory, Heidelberg
关键词
EVOLUTIONARY INFORMATION; MULTIPLE ALIGNMENTS; NEURAL NETWORKS; PROTEIN STRUCTURE PREDICTION;
D O I
10.1002/prot.340200303
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Currently, the prediction of three-dimensional (3D) protein structure from sequence alone is an exceedingly difficult task, As an intermediate step, a much simpler task has been pursued extensively: predicting 1D strings of secondary structure. Here, we present an analysis of another 1D projection from 3D structure: the relative solvent accessibility of each residue. We show that solvent accessibility is less conserved in 3D homologues than is secondary structure, and hence is predicted less accurately from automatic homology modeling; the correlation coefficient of relative solvent accessibility between 3D homologues is only 0.77, and the average accuracy of predictions based on sequence alignments is only 0.68, The latter number provides an effective upper limit on the accuracy of predicting accessibility from sequence when homology modeling is not possible, We introduce a neural network system that predicts relative solvent accessibility (projected onto ten discrete states) using evolutionary profiles of amino acid substitutions derived from multiple sequence alignments, Evaluated in a cross-validation test on 238 unique proteins, the correlation between predicted and observed relative accessibility is 0.54, Interpreted in terms of a three-state (buried, intermediate, exposed) description of relative accessibility, the fraction of correctly predicted residue states is about 58%, In absolute terms this accuracy appears poor, but given the relatively low conservation of accessibility in 3D families, the network system is not far from its likely optimal performance. The most reliably predicted fraction of the residues (50%) is predicted as accurately as by automatic homology modeling, Prediction is best for buried residues, e.g., 86% of the completely buried sites are correctly predicted as having 0% relative accessibility. (C) 1994 Wiley-Liss, Inc.
引用
收藏
页码:216 / 226
页数:11
相关论文
共 88 条
  • [1] KINETICS OF FORMATION OF NATIVE RIBONUCLEASE DURING OXIDATION OF REDUCED POLYPEPTIDE CHAIN
    ANFINSEN, CB
    HABER, E
    SELA, M
    WHITE, FH
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1961, 47 (09) : 1309 - +
  • [2] PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS
    ANFINSEN, CB
    [J]. SCIENCE, 1973, 181 (4096) : 223 - 230
  • [3] THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK
    BAIROCH, A
    BOECKMANN, B
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 : 2019 - 2022
  • [4] POLARITY AS A CRITERION IN PROTEIN DESIGN
    BAUMANN, G
    FROMMEL, C
    SANDER, C
    [J]. PROTEIN ENGINEERING, 1989, 2 (05): : 329 - 334
  • [5] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [6] 18TH KREBS,HANS LECTURE - KNOWLEDGE-BASED PROTEIN MODELING AND DESIGN
    BLUNDELL, T
    CARNEY, D
    GARDNER, S
    HAYES, F
    HOWLIN, B
    HUBBARD, T
    OVERINGTON, J
    SINGH, DA
    SIBANDA, BL
    SUTCLIFFE, M
    [J]. EUROPEAN JOURNAL OF BIOCHEMISTRY, 1988, 172 (03): : 513 - 520
  • [7] KNOWLEDGE-BASED PREDICTION OF PROTEIN STRUCTURES AND THE DESIGN OF NOVEL MOLECULES
    BLUNDELL, TL
    SIBANDA, BL
    STERNBERG, MJE
    THORNTON, JM
    [J]. NATURE, 1987, 326 (6111) : 347 - 352
  • [8] A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE
    BOWIE, JU
    LUTHY, R
    EISENBERG, D
    [J]. SCIENCE, 1991, 253 (5016) : 164 - 170
  • [9] ASSESSING THE QUALITY OF SOLUTION NUCLEAR-MAGNETIC-RESONANCE STRUCTURES BY COMPLETE CROSS-VALIDATION
    BRUNGER, AT
    CLORE, GM
    GRONENBORN, AM
    SAFFRICH, R
    NILGES, M
    [J]. SCIENCE, 1993, 261 (5119) : 328 - 331
  • [10] COMPUTATIONAL CHALLENGES FOR MACROMOLECULAR STRUCTURE DETERMINATION BY X-RAY CRYSTALLOGRAPHY AND SOLUTION NMR-SPECTROSCOPY
    BRUNGER, AT
    NILGES, M
    [J]. QUARTERLY REVIEWS OF BIOPHYSICS, 1993, 26 (01) : 49 - 125