REDEFINING THE GOALS OF PROTEIN SECONDARY STRUCTURE PREDICTION

被引:274
作者
ROST, B
SANDER, C
SCHNEIDER, R
机构
[1] EMBL Heidelberg Meyerhofstraße 1
关键词
SECONDARY STRUCTURE PREDICTION; PREDICTION ACCURACY; SECONDARY STRUCTURE SEGMENTS; EVALUATION; HOMOLOGOUS PROTEINS;
D O I
10.1016/S0022-2836(05)80007-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Secondary structure prediction recently has surpassed the 70% level of average accuracy, evaluated on the single residue states helix, strand and loop (Q3). But the ultimate goal is reliable prediction of tertiary (three-dimensional, 3D) structure, not 100% single residue accuracy for secondary structure. A comparison of pairs of structurally homologous proteins with divergent sequences reveals that considerable variation in the position and length of secondary structure segments can be accommodated within the same 3D fold. It is therefore sufficient to predict the approximate location of helix, strand, turn and loop segments, provided they are compatible with the formation of 3D structure. Accordingly, we define here a measure of segment overlap (Sov) that is somewhat insensitive to small variations in secondary structure assignments. The new segment overlap measure ranges from an ignorance level of 37% (random protein pairs) via a current level of 72% for a prediction method based on sequence profile input to neural networks (PHD) to an average 90% level for homologous protein pairs. We conclude that the highest scores one can reasonably expect for secondary structure prediction are a single residue accuracy of Q3 > 85% and a fractional segment overlap of Sov > 90%. © 1994 Academic Press Limited.
引用
收藏
页码:13 / 26
页数:14
相关论文
共 89 条
[21]   ANALYSIS OF ACCURACY AND IMPLICATIONS OF SIMPLE METHODS FOR PREDICTING SECONDARY STRUCTURE OF GLOBULAR PROTEINS [J].
GARNIER, J ;
OSGUTHORPE, DJ ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1978, 120 (01) :97-120
[22]  
GASCUEL O, 1988, COMPUT APPL BIOSCI, V4, P357
[23]   THE NITROGENASE MOFE PROTEIN - A SECONDARY STRUCTURE PREDICTION [J].
GERLOFF, DL ;
JENNY, TF ;
KNECHT, LJ ;
GONNET, GH ;
BENNER, SA .
FEBS LETTERS, 1993, 318 (02) :118-124
[24]   FURTHER DEVELOPMENTS OF PROTEIN SECONDARY STRUCTURE PREDICTION USING INFORMATION-THEORY - NEW PARAMETERS AND CONSIDERATION OF RESIDUE PAIRS [J].
GIBRAT, JF ;
GARNIER, J ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (03) :425-443
[25]  
HOBOHM U, 1992, PROTEIN SCI, V1, P409
[26]   PROTEIN SECONDARY STRUCTURE PREDICTION WITH A NEURAL NETWORK [J].
HOLLEY, LH ;
KARPLUS, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (01) :152-156
[27]   FAST AND SIMPLE MONTE-CARLO ALGORITHM FOR SIDE-CHAIN OPTIMIZATION IN PROTEINS - APPLICATION TO MODEL-BUILDING BY HOMOLOGY [J].
HOLM, L ;
SANDER, C .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1992, 14 (02) :213-223
[28]   A DATABASE OF PROTEIN-STRUCTURE FAMILIES WITH COMMON FOLDING MOTIFS [J].
HOLM, L ;
OUZOUNIS, C ;
SANDER, C ;
TUPAREV, G ;
VRIEND, G .
PROTEIN SCIENCE, 1992, 1 (12) :1691-1698
[29]   THE GREEK KEY MOTIF - EXTRACTION, CLASSIFICATION AND ANALYSIS [J].
HUTCHINSON, EG ;
THORNTON, JM .
PROTEIN ENGINEERING, 1993, 6 (03) :233-245
[30]   INFLUENCE OF NEAREST-NEIGHBOR AMINO-ACIDS ON CONFORMATION OF MIDDLE AMINO-ACID IN PROTEINS - COMPARISON OF PREDICTED AND EXPERIMENTAL DETERMINATION OF BETA-SHEETS IN CONCANAVALIN-A [J].
KABAT, EA ;
WU, TT .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1973, 70 (05) :1473-1477