Protein 8-class secondary structure prediction using conditional neural fields

被引:71
作者
Wang, Zhiyong [1 ]
Zhao, Feng [1 ]
Peng, Jian [1 ]
Xu, Jinbo [1 ]
机构
[1] Toyota Technol Inst, Chicago, IL USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Bioinformatics; Conditional neural fields; Eight class; Protein; Secondary structure prediction; SUPPORT VECTOR MACHINES; NETWORKS;
D O I
10.1002/pmic.201100196
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Compared with the protein 3-class secondary structure (SS) prediction, the 8-class prediction gains less attention and is also much more challenging, especially for proteins with few sequence homologs. This paper presents a new probabilistic method for 8-class SS prediction using conditional neural fields (CNFs), a recently invented probabilistic graphical model. This CNF method not only models the complex relationship between sequence features and SS, but also exploits the interdependency among SS types of adjacent residues. In addition to sequence profiles, our method also makes use of non-evolutionary information for SS prediction. Tested on the CB513 and RS126 data sets, our method achieves Q8 accuracy of 64.9 and 64.7%, respectively, which are much better than the SSpro8 web server (51.0 and 48.0%, respectively). Our method can also be used to predict other structure properties (e.g. solvent accessibility) of a protein or the SS of RNA.
引用
收藏
页码:3786 / 3792
页数:7
相关论文
共 37 条
[21]   Preorganized secondary structure as an important determinant of fast protein folding [J].
Myers, JK ;
Oas, TG .
NATURE STRUCTURAL BIOLOGY, 2001, 8 (06) :552-558
[22]   THE STRUCTURE OF PROTEINS - 2 HYDROGEN-BONDED HELICAL CONFIGURATIONS OF THE POLYPEPTIDE CHAIN [J].
PAULING, L ;
COREY, RB ;
BRANSON, HR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1951, 37 (04) :205-211
[23]  
Peng J., 2009, Advances in Neural Information Processing Systems, P1419
[24]   Low-homology protein threading [J].
Peng, Jian ;
Xu, Jinbo .
BIOINFORMATICS, 2010, 26 (12) :i294-i300
[25]  
Pirovano W, 2010, METHODS MOL BIOL, V609, P327, DOI 10.1007/978-1-60327-241-4_19
[26]   Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles [J].
Pollastri, G ;
Przybylski, D ;
Rost, B ;
Baldi, P .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 47 (02) :228-235
[27]   PREDICTING THE SECONDARY STRUCTURE OF GLOBULAR-PROTEINS USING NEURAL NETWORK MODELS [J].
Qian, N ;
SEJNOWSKI, TJ .
JOURNAL OF MOLECULAR BIOLOGY, 1988, 202 (04) :865-884
[28]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[29]   COMBINING EVOLUTIONARY INFORMATION AND NEURAL NETWORKS TO PREDICT PROTEIN SECONDARY STRUCTURE [J].
ROST, B ;
SANDER, C .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1994, 19 (01) :55-72
[30]   REDEFINING THE GOALS OF PROTEIN SECONDARY STRUCTURE PREDICTION [J].
ROST, B ;
SANDER, C ;
SCHNEIDER, R .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 235 (01) :13-26