A SIMPLE AND FAST APPROACH TO PREDICTION OF PROTEIN SECONDARY STRUCTURE FROM MULTIPLY ALIGNED SEQUENCES WITH ACCURACY ABOVE 70-PERCENT

被引:76
作者
MEHTA, PK [1 ]
HERINGA, J [1 ]
ARGOS, P [1 ]
机构
[1] EUROPEAN MOLEC BIOL LAB, D-69012 HEIDELBERG, GERMANY
关键词
AMINO ACID SEQUENCES; MULTIPLE SEQUENCE ALIGNMENT; PROTEIN SECONDARY STRUCTURE; SECONDARY STRUCTURE PREDICTION;
D O I
10.1002/pro.5560041208
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To improve secondary structure predictions in protein sequences, the information residing in multiple sequence alignments of substituted but structurally related proteins is exploited. A database comprised of 70 protein families and a total of 2,500 sequences, some of which were aligned by tertiary structural superpositions, was used to calculate residue exchange weight matrices within alpha-helical, beta-strand, and coil substructures, respectively. Secondary structure predictions were made based on the observed residue substitutions in local regions of the multiple alignments and the largest possible associated exchange weights in each of the three matrix types. Comparison of the observed and predicted secondary structure on a per-residue basis yielded a mean accuracy of 72.2%. Individual alpha-helix, beta-strand, and coil states were respectively predicted at 66.4, 66.7, and 75.8% correctness, representing a well-balanced three-state prediction. The accuracy level, verified by cross-validation through jack-knife tests on all protein families, dropped, on average, to only 70.9%, indicating the rigor of the prediction procedure. On the basis of robustness, conceptual clarity, accuracy, and executable efficiency, the method has considerable advantage, especially with its sole reliance on amino acid substitutions within structurally related proteins.
引用
收藏
页码:2517 / 2525
页数:9
相关论文
共 32 条
[1]   AMINO-ACID-SEQUENCE ANALYSIS OF THE ANNEXIN SUPERGENE FAMILY OF PROTEINS [J].
BARTON, GJ ;
NEWMAN, RH ;
FREEMONT, PS ;
CRUMPTON, MJ .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1991, 198 (03) :749-760
[3]   PATTERNS OF DIVERGENCE IN HOMOLOGOUS PROTEINS AS INDICATORS OF SECONDARY AND TERTIARY STRUCTURE - A PREDICTION OF THE STRUCTURE OF THE CATALYTIC DOMAIN OF PROTEIN-KINASES [J].
BENNER, SA ;
GERLOFF, D .
ADVANCES IN ENZYME REGULATION, 1991, 31 :121-181
[4]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[5]   PREDICTION OF PROTEIN CONFORMATION [J].
CHOU, PY ;
FASMAN, GD .
BIOCHEMISTRY, 1974, 13 (02) :222-245
[6]   COMPARISON OF 3 ALGORITHMS FOR THE ASSIGNMENT OF SECONDARY STRUCTURE IN PROTEINS - THE ADVANTAGES OF A CONSENSUS ASSIGNMENT [J].
COLLOCH, N ;
ETCHEBEST, C ;
THOREAU, E ;
HENRISSAT, B ;
MORNON, JP .
PROTEIN ENGINEERING, 1993, 6 (04) :377-382
[7]   PREDICTION OF SECONDARY STRUCTURE BY EVOLUTIONARY COMPARISON - APPLICATION TO THE ALPHA-SUBUNIT OF TRYPTOPHAN SYNTHASE [J].
CRAWFORD, IP ;
NIERMANN, T ;
KIRSCHNER, K .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1987, 2 (02) :118-129
[8]   A COMPREHENSIVE SET OF SEQUENCE-ANALYSIS PROGRAMS FOR THE VAX [J].
DEVEREUX, J ;
HAEBERLI, P ;
SMITHIES, O .
NUCLEIC ACIDS RESEARCH, 1984, 12 (01) :387-395
[9]   THE PREDICTION AND ORIENTATION OF ALPHA-HELICES FROM SEQUENCE ALIGNMENTS - THE COMBINED USE OF ENVIRONMENT-DEPENDENT SUBSTITUTION TABLES, FOURIER-TRANSFORM METHODS AND HELIX CAPPING RULES [J].
DONNELLY, D ;
OVERINGTON, JP ;
BLUNDELL, TL .
PROTEIN ENGINEERING, 1994, 7 (05) :645-653
[10]   ANALYSIS OF ACCURACY AND IMPLICATIONS OF SIMPLE METHODS FOR PREDICTING SECONDARY STRUCTURE OF GLOBULAR PROTEINS [J].
GARNIER, J ;
OSGUTHORPE, DJ ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1978, 120 (01) :97-120