Protein secondary structure prediction using local alignments

被引:86
作者
Salamov, AA [1 ]
Solovyev, VV [1 ]
机构
[1] BAYLOR COLL MED, DEPT CELL BIOL, HOUSTON, TX 77030 USA
关键词
protein secondary structure prediction; local alignments; 3D-1D environment scoring; substitution matrix;
D O I
10.1006/jmbi.1997.0958
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The accuracy of secondary structure prediction methods has been improved significantly by the use of aligned protein sequences. The PHD method and the NNSSP method reach 71 to 72% of sustained overall three-state accuracy when multiple sequence alignments are with neural networks and nearest-neighbor algorithms, respectively. We introduce a variant of the nearest-neighbor approach that can achieve similar accuracy using a single sequence as the query input. We compute the 50 best non-intersecting local alignments of the query sequence with each sequence from a set of proteins with known 3D structures. Each position of the query sequence is aligned with the database amino acids in alpha-helical, beta-strand or coil states. The prediction type of secondary structure is selected as the type of aligned position with the maximal total score. On the dataset of 124 non-membrane non-homologous proteins, used earlier as a benchmark for secondary structure predictions, our method reaches an overall three-state accuracy of 71.2%. The performance accuracy is verified by an additional test on 461 non-homologous proteins giving an accuracy of 71.0%. The main strength of the method is the high level of prediction accuracy for proteins without any known homolog. Using multiple sequence alignments as input the method has a prediction accuracy of 73.5%. Prediction of secondary structure by the SSPAL method is available via Baylor College of Medicine World Wide Web server. (C) 1997 Academic Press Limited.
引用
收藏
页码:31 / 36
页数:6
相关论文
共 34 条
[1]   PREDICTED SECONDARY STRUCTURE FOR THE SRC HOMOLOGY-3 DOMAIN [J].
BENNER, SA ;
COHEN, MA ;
GERLOFF, D .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 229 (02) :295-305
[2]  
BENNER SA, 1989, ADV ENZYME REGUL, V28, P219, DOI 10.1016/0065-2571(89)90073-3
[3]  
BENNER SA, 1992, CURR OPIN STRUC BIOL, V2, P402
[4]   SECONDARY STRUCTURE PREDICTION - COMBINATION OF 3 DIFFERENT METHODS [J].
BIOU, V ;
GIBRAT, JF ;
LEVIN, JM ;
ROBSON, B ;
GARNIER, J .
PROTEIN ENGINEERING, 1988, 2 (03) :185-191
[5]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[6]   EMPIRICAL PREDICTIONS OF PROTEIN CONFORMATION [J].
CHOU, PY ;
FASMAN, GD .
ANNUAL REVIEW OF BIOCHEMISTRY, 1978, 47 :251-276
[7]   PREDICTION OF SECONDARY STRUCTURE BY EVOLUTIONARY COMPARISON - APPLICATION TO THE ALPHA-SUBUNIT OF TRYPTOPHAN SYNTHASE [J].
CRAWFORD, IP ;
NIERMANN, T ;
KIRSCHNER, K .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1987, 2 (02) :118-129
[8]   EVALUATION OF CURRENT TECHNIQUES FOR AB-INITIO PROTEIN-STRUCTURE PREDICTION [J].
DEFAY, T ;
COHEN, FE .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1995, 23 (03) :431-445
[9]  
Fischer D, 1996, PROTEIN SCI, V5, P947
[10]   EXHAUSTIVE MATCHING OF THE ENTIRE PROTEIN-SEQUENCE DATABASE [J].
GONNET, GH ;
COHEN, MA ;
BENNER, SA .
SCIENCE, 1992, 256 (5062) :1443-1445