Position-specific residue preference features around the ends of helices and strands and a novel strategy for the prediction of secondary structures

被引:19
作者
Duan, Mojie [1 ]
Huang, Min [1 ]
Ma, Chuang [1 ]
Li, Lun [1 ]
Zhou, Yanhong [1 ]
机构
[1] Huazhong Univ Sci & Technol, Hubei Bioinformat & Mol Imaging Key Lab, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
secondary structure prediction; position-specific residue preference; ends of secondary structures; protein structure prediction;
D O I
10.1110/ps.035691.108
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
It has been many years since position-specific residue preference around the ends of a helix was revealed. However, all the existing secondary structure prediction methods did not exploit this preference feature, resulting in low accuracy in predicting the ends of secondary structures. In this study, we collected a relatively large data set consisting of 1860 high-resolution, non-homology proteins from the PDB, and further analyzed the residue distributions around the ends of regular secondary structures. It was found that there exist position-specific residue preferences (PSRP) around the ends of not only helices but also strands. Based on the unique features, we proposed a novel strategy and developed a tool named E-SSpred that treats the secondary structure as a whole and builds models to predict entire secondary structure segments directly by integrating relevant features. In E-SSpred, the support vector machine (SVM) method is adopted to model and predict the ends of helices and strands according to the unique residue distributions around them. A simple linear discriminate analysis method is applied to model and predict entire secondary structure segments by integrating end-prediction results, tri-peptide composition, and length distribution features of secondary structures, as well as the prediction results of the most famous program PSIPRED. The results of fivefold cross-validation on a widely used data set demonstrate that the accuracy of E-SSpred in predicting ends of secondary structures is about 10% higher than PSIPRED, and the overall prediction accuracy (Q(3) value) of E-SSpred (82.2%) is also better than PSIPRED (80.3%). The E-SSpred web server is available at http://bioinfo.hust.edu.cn/bio/tools/E-SSpred/index.html.
引用
收藏
页码:1505 / 1512
页数:8
相关论文
共 37 条
[1]   RULES FOR ALPHA-HELIX TERMINATION BY GLYCINE [J].
AURORA, R ;
SRINIVASAN, R ;
ROSE, GD .
SCIENCE, 1994, 264 (5162) :1126-1130
[2]   Protein structure prediction and structural genomics [J].
Baker, D ;
Sali, A .
SCIENCE, 2001, 294 (5540) :93-96
[3]   Is protein folding hierarchic? I. Local structure and peptide folding [J].
Baldwin, RL ;
Rose, GD .
TRENDS IN BIOCHEMICAL SCIENCES, 1999, 24 (01) :26-33
[4]   Prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary information [J].
Bao, L ;
Cui, Y .
BIOINFORMATICS, 2005, 21 (10) :2185-2190
[5]   Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: Structure-based assessment of amino acid variation [J].
Chasman, D ;
Adams, RM .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (02) :683-706
[6]   CONFORMATIONAL PARAMETERS FOR AMINO-ACIDS IN HELICAL, BETA-SHEET, AND RANDOM COIL REGIONS CALCULATED FROM PROTEINS [J].
CHOU, PY ;
FASMAN, GD .
BIOCHEMISTRY, 1974, 13 (02) :211-222
[7]  
Cuff JA, 1999, PROTEINS, V34, P508, DOI 10.1002/(SICI)1097-0134(19990301)34:4<508::AID-PROT10>3.0.CO
[8]  
2-4
[9]   Sequence comparison and protein structure prediction [J].
Dunbrack, Roland L., Jr. .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2006, 16 (03) :374-384
[10]   RETRACTED: Computational design of a biologically active enzyme (Retracted article. See vol 319, pg 569, 2008) [J].
Dwyer, MA ;
Looger, LL ;
Hellinga, HW .
SCIENCE, 2004, 304 (5679) :1967-1971