Porter: a new, accurate server for protein secondary structure prediction

被引:341
作者
Pollastri, G [1 ]
McLysaght, A
机构
[1] Natl Univ Ireland Univ Coll Dublin, Dept Comp Sci, Dublin 4, Ireland
[2] Univ Dublin Trinity Coll, Dept Genet, Dublin 2, Ireland
关键词
D O I
10.1093/bioinformatics/bti203
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Porter is a new system for protein secondary structure prediction in three classes. Porter relies on bidirectional recurrent neural networks with shortcut connections, accurate coding of input profiles obtained from multiple sequence alignments, second stage filtering by recurrent neural networks, incorporation of long range information and large-scale ensembles of predictors. Porter's accuracy, tested by rigorous 5-fold cross-validation on a large set of proteins, exceeds 79%, significantly above a copy of the state-of-the-art SSpro server, better than any system published to date.
引用
收藏
页码:1719 / 1720
页数:2
相关论文
共 12 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Exploiting the past and the future in protein secondary structure prediction [J].
Baldi, P ;
Brunak, S ;
Frasconi, P ;
Soda, G ;
Pollastri, G .
BIOINFORMATICS, 1999, 15 (11) :937-946
[3]   Rosetta predictions in CASP5: Successes, failures, and prospects for complete automation [J].
Bradley, P ;
Chivian, D ;
Meiler, J ;
Misura, KMS ;
Rohl, CA ;
Schief, WR ;
Wedemeyer, WJ ;
Schueler-Furman, O ;
Murphy, P ;
Schonbrun, J ;
Strauss, CEM ;
Baker, D .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :457-468
[4]   GenTHREADER: An efficient and reliable protein fold recognition method for genomic sequences [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 287 (04) :797-815
[5]   Protein secondary structure prediction based on position-specific scoring matrices [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 292 (02) :195-202
[6]   DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES [J].
KABSCH, W ;
SANDER, C .
BIOPOLYMERS, 1983, 22 (12) :2577-2637
[7]  
Lesk AM, 2001, PROTEINS, P98
[8]  
Petersen TN, 2000, PROTEINS, V41, P17, DOI 10.1002/1097-0134(20001001)41:1<17::AID-PROT40>3.3.CO
[9]  
2-6
[10]   Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles [J].
Pollastri, G ;
Przybylski, D ;
Rost, B ;
Baldi, P .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 47 (02) :228-235