Identification and application of the concepts important for accurate and reliable protein secondary structure prediction

被引:378
作者
King, RD [1 ]
Sternberg, MJE [1 ]
机构
[1] IMPERIAL CANC RES FUND, BIOMOLEC MODELLING LAB, LONDON WC2A 3PX, ENGLAND
关键词
prediction; secondary structure; statistics;
D O I
10.1002/pro.5560051116
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A protein secondary structure prediction method from multiply aligned homologous sequences is presented with an overall per residue three-state accuracy of 70.1%. There are two aims: to obtain high accuracy by identification of a set of concepts important for prediction followed by use of linear statistics; and to provide insight into the folding process, The important concepts in secondary structure prediction are identified as: residue conformational propensities, sequence edge effects, moments of hydrophobicity, position of insertions and deletions in aligned homologous sequence, moments of conservation, auto-correlation, residue ratios, secondary structure feedback effects, and filtering. Explicit use of edge effects, moments of conservation, and auto-correlation are new to this paper, The relative importance of the concepts used in prediction was analyzed by stepwise addition of information and examination of weights in the discrimination function, The simple and explicit structure of the prediction allows the method to be reimplemented easily, The accuracy of a prediction is predictable a priori, This permits evaluation of the utility of the prediction: 10% of the chains predicted were identified correctly as having a mean accuracy of >80%. Existing high-accuracy prediction methods are ''black-box'' predictors based on complex nonlinear statistics (e.g., neural networks in PHD: Rost & Sander, 1993a). For medium- to short-length chains (greater than or equal to 90 residues and <170 residues), the prediction method is significantly more accurate (P < 0.01) than the PHD algorithm (probably the most commonly used algorithm), In combination with the PHD, an algorithm is formed that is significantly more accurate than either method, with an estimated overall three-state accuracy of 72.4%, the highest accuracy reported for any prediction method.
引用
收藏
页码:2298 / 2310
页数:13
相关论文
共 50 条
  • [1] PATTERNS OF DIVERGENCE IN HOMOLOGOUS PROTEINS AS INDICATORS OF SECONDARY AND TERTIARY STRUCTURE - A PREDICTION OF THE STRUCTURE OF THE CATALYTIC DOMAIN OF PROTEIN-KINASES
    BENNER, SA
    GERLOFF, D
    [J]. ADVANCES IN ENZYME REGULATION, 1991, 31 : 121 - 181
  • [2] PREDICTING THE CONFORMATION OF PROTEINS - MAN VERSUS MACHINE
    BENNER, SA
    GERLOFF, DL
    [J]. FEBS LETTERS, 1993, 325 (1-2): : 29 - 33
  • [3] CORRECT STRUCTURE PREDICTION
    BENNER, SA
    COHEN, MA
    GERLOFF, D
    [J]. NATURE, 1992, 359 (6398) : 781 - 781
  • [4] SECONDARY STRUCTURE PREDICTION - COMBINATION OF 3 DIFFERENT METHODS
    BIOU, V
    GIBRAT, JF
    LEVIN, JM
    ROBSON, B
    GARNIER, J
    [J]. PROTEIN ENGINEERING, 1988, 2 (03): : 185 - 191
  • [5] STRUCTURAL BASIS OF AMINO-ACID ALPHA-HELIX PROPENSITY
    BLABER, M
    ZHANG, XJ
    MATTHEWS, BW
    [J]. SCIENCE, 1993, 260 (5114) : 1637 - 1640
  • [6] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [7] PROTEIN DESIGN - A HIERARCHICAL APPROACH
    BRYSON, JW
    BETZ, SF
    LU, HS
    SUICH, DJ
    ZHOU, HXX
    ONEIL, KT
    DEGRADO, WF
    [J]. SCIENCE, 1995, 270 (5238) : 935 - 941
  • [8] PREDICTION OF PROTEIN CONFORMATION
    CHOU, PY
    FASMAN, GD
    [J]. BIOCHEMISTRY, 1974, 13 (02) : 222 - 245
  • [9] SECONDARY STRUCTURE ASSIGNMENT FOR ALPHA-BETA-PROTEINS BY A COMBINATORIAL APPROACH
    COHEN, FE
    ABARBANEL, RM
    KUNTZ, ID
    FLETTERICK, RJ
    [J]. BIOCHEMISTRY, 1983, 22 (21) : 4894 - 4904
  • [10] COMPARISON OF 3 ALGORITHMS FOR THE ASSIGNMENT OF SECONDARY STRUCTURE IN PROTEINS - THE ADVANTAGES OF A CONSENSUS ASSIGNMENT
    COLLOCH, N
    ETCHEBEST, C
    THOREAU, E
    HENRISSAT, B
    MORNON, JP
    [J]. PROTEIN ENGINEERING, 1993, 6 (04): : 377 - 382