REDEFINING THE GOALS OF PROTEIN SECONDARY STRUCTURE PREDICTION

被引:274
作者
ROST, B
SANDER, C
SCHNEIDER, R
机构
[1] EMBL Heidelberg Meyerhofstraße 1
关键词
SECONDARY STRUCTURE PREDICTION; PREDICTION ACCURACY; SECONDARY STRUCTURE SEGMENTS; EVALUATION; HOMOLOGOUS PROTEINS;
D O I
10.1016/S0022-2836(05)80007-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Secondary structure prediction recently has surpassed the 70% level of average accuracy, evaluated on the single residue states helix, strand and loop (Q3). But the ultimate goal is reliable prediction of tertiary (three-dimensional, 3D) structure, not 100% single residue accuracy for secondary structure. A comparison of pairs of structurally homologous proteins with divergent sequences reveals that considerable variation in the position and length of secondary structure segments can be accommodated within the same 3D fold. It is therefore sufficient to predict the approximate location of helix, strand, turn and loop segments, provided they are compatible with the formation of 3D structure. Accordingly, we define here a measure of segment overlap (Sov) that is somewhat insensitive to small variations in secondary structure assignments. The new segment overlap measure ranges from an ignorance level of 37% (random protein pairs) via a current level of 72% for a prediction method based on sequence profile input to neural networks (PHD) to an average 90% level for homologous protein pairs. We conclude that the highest scores one can reasonably expect for secondary structure prediction are a single residue accuracy of Q3 > 85% and a fractional segment overlap of Sov > 90%. © 1994 Academic Press Limited.
引用
收藏
页码:13 / 26
页数:14
相关论文
共 89 条
[11]  
BURGESS AW, 1974, ISRAEL J CHEM, V12, P239
[12]   THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS [J].
CHOTHIA, C ;
LESK, AM .
EMBO JOURNAL, 1986, 5 (04) :823-826
[13]  
Chou P Y, 1978, Adv Enzymol Relat Areas Mol Biol, V47, P45
[14]   CONFORMATIONAL PARAMETERS FOR AMINO-ACIDS IN HELICAL, BETA-SHEET, AND RANDOM COIL REGIONS CALCULATED FROM PROTEINS [J].
CHOU, PY ;
FASMAN, GD .
BIOCHEMISTRY, 1974, 13 (02) :211-222
[15]   TURN PREDICTION IN PROTEINS USING A PATTERN-MATCHING APPROACH [J].
COHEN, FE ;
ABARBANEL, RM ;
KUNTZ, ID ;
FLETTERICK, RJ .
BIOCHEMISTRY, 1986, 25 (01) :266-275
[16]   SECONDARY STRUCTURE ASSIGNMENT FOR ALPHA-BETA-PROTEINS BY A COMBINATORIAL APPROACH [J].
COHEN, FE ;
ABARBANEL, RM ;
KUNTZ, ID ;
FLETTERICK, RJ .
BIOCHEMISTRY, 1983, 22 (21) :4894-4904
[17]  
COHEN FE, 1989, PREDICTION PROTEIN S, P647
[18]   COMPARISON OF 3 ALGORITHMS FOR THE ASSIGNMENT OF SECONDARY STRUCTURE IN PROTEINS - THE ADVANTAGES OF A CONSENSUS ASSIGNMENT [J].
COLLOCH, N ;
ETCHEBEST, C ;
THOREAU, E ;
HENRISSAT, B ;
MORNON, JP .
PROTEIN ENGINEERING, 1993, 6 (04) :377-382
[19]   GENETIC CONTROL OF TERTIARY PROTEIN STUCTURE - STUDIES WITH MODEL SYSTEMS [J].
EPSTEIN, CJ ;
GOLDBERGER, RF ;
ANFINSEN, CB .
COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 1963, 28 :439-&
[20]   STATISTICAL ANALYSIS OF CORRELATION AMONG AMINO ACID RESIDUES IN HELICAL, BETA-STRUCTURAL AND NON-REGULAR REGIONS OF GLOBULAR PROTEINS [J].
FINKELSTEIN, AV ;
PTITSYN, OB .
JOURNAL OF MOLECULAR BIOLOGY, 1971, 62 (03) :613-+