Protein 8-class secondary structure prediction using conditional neural fields

被引:71
作者
Wang, Zhiyong [1 ]
Zhao, Feng [1 ]
Peng, Jian [1 ]
Xu, Jinbo [1 ]
机构
[1] Toyota Technol Inst, Chicago, IL USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Bioinformatics; Conditional neural fields; Eight class; Protein; Secondary structure prediction; SUPPORT VECTOR MACHINES; NETWORKS;
D O I
10.1002/pmic.201100196
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Compared with the protein 3-class secondary structure (SS) prediction, the 8-class prediction gains less attention and is also much more challenging, especially for proteins with few sequence homologs. This paper presents a new probabilistic method for 8-class SS prediction using conditional neural fields (CNFs), a recently invented probabilistic graphical model. This CNF method not only models the complex relationship between sequence features and SS, but also exploits the interdependency among SS types of adjacent residues. In addition to sequence profiles, our method also makes use of non-evolutionary information for SS prediction. Tested on the CB513 and RS126 data sets, our method achieves Q8 accuracy of 64.9 and 64.7%, respectively, which are much better than the SSpro8 web server (51.0 and 48.0%, respectively). Our method can also be used to predict other structure properties (e.g. solvent accessibility) of a protein or the SS of RNA.
引用
收藏
页码:3786 / 3792
页数:7
相关论文
共 37 条
[1]  
ASAI K, 1993, COMPUT APPL BIOSCI, V9, P141
[2]   Protein secondary structure prediction for a single-sequence using hidden semi-Markov models [J].
Aydin, Zafer ;
Altunbasak, Yucel ;
Borodovsky, Mark .
BMC BIOINFORMATICS, 2006, 7 (1)
[3]   Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures [J].
Bodén, M ;
Yuan, Z ;
Bailey, TL .
BMC BIOINFORMATICS, 2006, 7 (1)
[4]   Identifying sequence regions undergoing conformational change via predicted continuum secondary structure [J].
Boden, Mikael ;
Bailey, Timothy L. .
BIOINFORMATICS, 2006, 22 (15) :1809-1814
[5]   Improved residue contact prediction using support vector machines and a large feature set [J].
Cheng, Jianlin ;
Baldi, Pierre .
BMC BIOINFORMATICS, 2007, 8 (1)
[6]  
Cuff JA, 1999, PROTEINS, V34, P508, DOI 10.1002/(SICI)1097-0134(19990301)34:4<508::AID-PROT10>3.0.CO
[7]  
2-4
[8]   Mimicking the folding pathway to improve homology-free protein structure prediction [J].
DeBartolo, Joe ;
Colubri, Andres ;
Jha, Abhishek K. ;
Fitzgerald, James E. ;
Freed, Karl F. ;
Sosnick, Tobin R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (10) :3734-3739
[9]   Position-specific residue preference features around the ends of helices and strands and a novel strategy for the prediction of secondary structures [J].
Duan, Mojie ;
Huang, Min ;
Ma, Chuang ;
Li, Lun ;
Zhou, Yanhong .
PROTEIN SCIENCE, 2008, 17 (09) :1505-1512
[10]   A novel method for protein secondary structure prediction using dual-layer SVM and profiles [J].
Guo, J ;
Chen, H ;
Sun, ZR ;
Lin, YL .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 54 (04) :738-743