REDEFINING THE GOALS OF PROTEIN SECONDARY STRUCTURE PREDICTION

被引:274
作者
ROST, B
SANDER, C
SCHNEIDER, R
机构
[1] EMBL Heidelberg Meyerhofstraße 1
关键词
SECONDARY STRUCTURE PREDICTION; PREDICTION ACCURACY; SECONDARY STRUCTURE SEGMENTS; EVALUATION; HOMOLOGOUS PROTEINS;
D O I
10.1016/S0022-2836(05)80007-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Secondary structure prediction recently has surpassed the 70% level of average accuracy, evaluated on the single residue states helix, strand and loop (Q3). But the ultimate goal is reliable prediction of tertiary (three-dimensional, 3D) structure, not 100% single residue accuracy for secondary structure. A comparison of pairs of structurally homologous proteins with divergent sequences reveals that considerable variation in the position and length of secondary structure segments can be accommodated within the same 3D fold. It is therefore sufficient to predict the approximate location of helix, strand, turn and loop segments, provided they are compatible with the formation of 3D structure. Accordingly, we define here a measure of segment overlap (Sov) that is somewhat insensitive to small variations in secondary structure assignments. The new segment overlap measure ranges from an ignorance level of 37% (random protein pairs) via a current level of 72% for a prediction method based on sequence profile input to neural networks (PHD) to an average 90% level for homologous protein pairs. We conclude that the highest scores one can reasonably expect for secondary structure prediction are a single residue accuracy of Q3 > 85% and a fractional segment overlap of Sov > 90%. © 1994 Academic Press Limited.
引用
收藏
页码:13 / 26
页数:14
相关论文
共 89 条
[1]  
Anfinsen C B, 1975, Adv Protein Chem, V29, P205, DOI 10.1016/S0065-3233(08)60413-1
[2]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[3]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1992, 20 :2019-2022
[4]   POLARITY AS A CRITERION IN PROTEIN DESIGN [J].
BAUMANN, G ;
FROMMEL, C ;
SANDER, C .
PROTEIN ENGINEERING, 1989, 2 (05) :329-334
[5]   PREDICTED SECONDARY STRUCTURE FOR THE SRC HOMOLOGY-3 DOMAIN [J].
BENNER, SA ;
COHEN, MA ;
GERLOFF, D .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 229 (02) :295-305
[6]  
BENNER SA, 1992, CURR OPIN STRUC BIOL, V2, P402
[7]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[8]   SECONDARY STRUCTURE PREDICTION - COMBINATION OF 3 DIFFERENT METHODS [J].
BIOU, V ;
GIBRAT, JF ;
LEVIN, JM ;
ROBSON, B ;
GARNIER, J .
PROTEIN ENGINEERING, 1988, 2 (03) :185-191
[9]  
BRANDEN C, 1991, INTRO PROTEIN STRUCT
[10]   BETWEEN OBJECTIVITY AND SUBJECTIVITY [J].
BRANDEN, CI ;
JONES, TA .
NATURE, 1990, 343 (6260) :687-689