Cascaded bidirectional recurrent neural networks for protein secondary structure prediction

被引:25
作者
Chen, Jinmiao [1 ]
Chaudhari, Narendra S. [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
protein secondary structure prediction; cascaded bidirectional recurrent neural networks; long-range interactions;
D O I
10.1109/TCBB.2007.1055
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein secondary structure (PSS) prediction is an important topic in bioinformatics. Our study on a large set of nonhomologous proteins shows that long-range interactions commonly exist and negatively affect PSS prediction. Besides, we also reveal strong correlations between secondary structure (SS) elements. In order to take into account the long-range interactions and SS-SS correlations, we propose a novel prediction system based on a cascaded bidirectional recurrent neural network (BRNN). We compare the cascaded BRNN against two other BRNN architectures, namely, the original BRNN architecture used for speech recognition and Pollastri's BRNN, which was proposed for PSS prediction. Our cascaded BRNN achieves an overall three-state accuracy Q3 of 74.38 percent and reaches a high Segment OVerlap (SOV) of 66.0455. It outperforms the original BRNN and Pollastri's BRNN in both Q3 and SOV. Specifically, it improves the SOV score by 4-6 percent.
引用
收藏
页码:572 / 582
页数:11
相关论文
共 32 条
[1]  
BALDI P, 2000, BIDIRECTIONAL IOHMMS
[2]  
Chen JX, 2006, INT C COMP SUPP COOP, P296
[3]   Protein secondary structure: entropy, correlations and prediction [J].
Crooks, GE ;
Brenner, SE .
BIOINFORMATICS, 2004, 20 (10) :1603-1611
[4]  
Cuff JA, 1999, PROTEINS, V34, P508, DOI 10.1002/(SICI)1097-0134(19990301)34:4<508::AID-PROT10>3.0.CO
[5]  
2-4
[6]  
Efron B., 1993, INTRO BOOTSTRAP MONO, DOI DOI 10.1201/9780429246593
[7]  
FRANCESCO VD, 1996, PROTEIN SCI, V5, P106
[8]   Knowledge-based protein secondary structure assignment [J].
Frishman, D ;
Argos, P .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (04) :566-579
[9]   FURTHER DEVELOPMENTS OF PROTEIN SECONDARY STRUCTURE PREDICTION USING INFORMATION-THEORY - NEW PARAMETERS AND CONSIDERATION OF RESIDUE PAIRS [J].
GIBRAT, JF ;
GARNIER, J ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (03) :425-443
[10]   Combining protein secondary structure prediction models with ensemble methods of optimal complexity [J].
Guermeur, Y ;
Pollastri, G ;
Elisseeff, A ;
Zelus, D ;
Paugam-Moisy, H ;
Baldi, P .
NEUROCOMPUTING, 2004, 56 :305-327