WEAK CORRELATION BETWEEN PREDICTIVE POWER OF INDIVIDUAL SEQUENCE PATTERNS AND OVERALL PREDICTION ACCURACY IN PROTEINS

被引:27
作者
ROOMAN, MJ [1 ]
WODAK, SJ [1 ]
机构
[1] UNIV LIBRE BRUXELLES,UNITE CONFORMAT MACROMOLEC BIOL,CP 160,AV P HEGER,P2,B-1050 BRUSSELS,BELGIUM
来源
PROTEINS-STRUCTURE FUNCTION AND GENETICS | 1991年 / 9卷 / 01期
关键词
STRUCTURE DATABASE; AMINO ACID PROPERTIES; EARLY FOLDING INTERMEDIATES; STABLE PEPTIDE STRUCTURES;
D O I
10.1002/prot.340090108
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Patterns in amino acid properties (polar, hydrophobic, etc.) that characterize secondary structure motifs are derived from a database containing 75 protein structures, with the aim of circumventing the limitations due to data base size so as to increase structure prediction score. Many such sequence-structure associations with high intrinsic predictive power are found, which turn out to be correct 78% of the time when applied individually to proteins outside the learning set. Based on these associations, a prediction method is developed, which reaches the score of 62% on the 3 states alpha-helix, beta-strand, and loop, without using additional constraints. Though this score is quite good compared to that of other available prediction methods, it is much lower than could be expected from the high intrinsic predictive power of the associations used. The reasons underlying this surprising result, which indicate that prediction score and intrinsic predictive power are only weakly coupled, are discussed. It is also shown that the size of the present database still seriously limits prediction scores, even when property patterns are used, and that higher scores are expected in large databases. Clues are provided on the relative influence of neglecting spatial interactions on prediction efficiency, suggesting that, in sufficiently large databases, predicted secondary structures would correspond to those formed early in the folding process. This hypothesis is tested by confronting present predictions with available experimental data on early protein folding intermediates and on small peptides that adopt a relatively stable conformation in water. Although admittedly there are still too few such data, results suggest that the hypothesis might be well founded.
引用
收藏
页码:69 / 78
页数:10
相关论文
共 45 条
[11]  
EISENBERG D, 1986, Proteins Structure Function and Genetics, V1, P16, DOI 10.1002/prot.340010105
[12]   A STRUCTURAL MODEL FOR THE CHROMOPHORE-BINDING DOMAIN OF OVINE RHODOPSIN [J].
ELIOPOULOS, E ;
GEDDES, AJ ;
BRETT, M ;
PAPPIN, DJC ;
FINDLAY, JBC .
INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 1982, 4 (05) :263-268
[13]   STATISTICAL ANALYSIS OF CORRELATION AMONG AMINO ACID RESIDUES IN HELICAL, BETA-STRUCTURAL AND NON-REGULAR REGIONS OF GLOBULAR PROTEINS [J].
FINKELSTEIN, AV ;
PTITSYN, OB .
JOURNAL OF MOLECULAR BIOLOGY, 1971, 62 (03) :613-+
[14]   ANALYSIS OF ACCURACY AND IMPLICATIONS OF SIMPLE METHODS FOR PREDICTING SECONDARY STRUCTURE OF GLOBULAR PROTEINS [J].
GARNIER, J ;
OSGUTHORPE, DJ ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1978, 120 (01) :97-120
[15]  
GASCUEL O, 1988, COMPUT APPL BIOSCI, V4, P357
[16]   FURTHER DEVELOPMENTS OF PROTEIN SECONDARY STRUCTURE PREDICTION USING INFORMATION-THEORY - NEW PARAMETERS AND CONSIDERATION OF RESIDUE PAIRS [J].
GIBRAT, JF ;
GARNIER, J ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (03) :425-443
[17]   MUTATIONAL ANALYSIS OF A PROTEIN-FOLDING PATHWAY [J].
GOLDENBERG, DP ;
FRIEDEN, RW ;
HAACK, JA ;
MORRISON, TB .
NATURE, 1989, 338 (6211) :127-132
[18]  
HOLLEY HL, 1989, P NATL ACAD SCI USA, V86, P152
[19]  
HULL W, 1988, 13 INT C MAGN RES BI, P14
[20]  
HUYSMANS M, 1989, UNPUB PROTEINS