A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction

被引：20

作者：

Spencer, Matt ^{[1
]}

Eickholt, Jesse ^{[2
]}

Cheng, Jianlin ^{[3
]}

机构：

[1] Univ Missouri, Inst Informat, Columbia, MO 65211 USA

[2] Cent Michigan Univ, Dept Comp Sci, Mt Pleasant, MI 48859 USA

[3] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA

来源：

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS | 2015年 / 12卷 / 01期

基金：

美国国家卫生研究院;

关键词：

Machine learning; neural nets; protein structure prediction; deep learning; NEURAL-NETWORKS; GENERATION; ACCURATE;

D O I：

10.1109/TCBB.2014.2343960

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Ab initio protein secondary structure (SS) predictions are utilized to generate tertiary structure predictions, which are increasingly demanded due to the rapid discovery of proteins. Although recent developments have slightly exceeded previous methods of SS prediction, accuracy has stagnated around 80 percent and many wonder if prediction cannot be advanced beyond this ceiling. Disciplines that have traditionally employed neural networks are experimenting with novel deep learning techniques in attempts to stimulate progress. Since neural networks have historically played an important role in SS prediction, we wanted to determine whether deep learning could contribute to the advancement of this field as well. We developed an SS predictor that makes use of the position-specific scoring matrix generated by PSI-BLAST and deep learning network architectures, which we call DNSS. Graphical processing units and CUDA software optimize the deep network architecture and efficiently train the deep networks. Optimal parameters for the training process were determined, and a workflow comprising three separately trained deep networks was constructed in order to make refined predictions. This deep learning network approach was used to predict SS for a fully independent test dataset of 198 proteins, achieving a Q(3) accuracy of 80.7 percent and a Sov accuracy of 74.2 percent.

引用

页码：103 / 112

页数：10

共 54 条

[41] Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles
Pollastri, G
Przybylski, D
Rost, B
Baldi, P
[J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 47 (02) : 228 - 235
[42] Improving fold recognition without folds
Przybylski, D
Rost, B
[J]. JOURNAL OF MOLECULAR BIOLOGY, 2004, 341 (01) : 255 - 269
[43] A Unified Multitask Architecture for Predicting Local Protein Properties
Qi, Yanjun
Oja, Merja
Weston, Jason
Noble, William Stafford
[J]. PLOS ONE, 2012, 7 (03):
[44] COMBINING EVOLUTIONARY INFORMATION AND NEURAL NETWORKS TO PREDICT PROTEIN SECONDARY STRUCTURE
ROST, B
SANDER, C
[J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1994, 19 (01) : 55 - 72
[45] Neural network-based face detection
Rowley, HA
Baluja, S
Kanade, T
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (01) : 23 - 38
[46] Smolensky P., 1986, Information processing in dynamical systems: Foundations of Harmony Theory,'' Parallel Distributed Processing: Exploration in the Microstructure of Cognition, V1
[47] Stork D. G., 1992, IJCNN International Joint Conference on Neural Networks (Cat. No.92CH3114-6), P289, DOI 10.1109/IJCNN.1992.226994
[48] ESpritz: accurate and fast prediction of protein disorder
Walsh, Ian
Martin, Alberto J. M.
Di Domenico, Tomas
Tosatto, Silvio C. E.
[J]. BIOINFORMATICS, 2012, 28 (04) : 503 - 509
[49] Protein 8-class secondary structure prediction using conditional neural fields
Wang, Zhiyong
Zhao, Feng
Peng, Jian
Xu, Jinbo
[J]. PROTEOMICS, 2011, 11 (19) : 3786 - 3792
[50] Secondary structure prediction with support vector machines
Ward, JJ
McGuffin, LJ
Buxton, BF
Jones, DT
[J]. BIOINFORMATICS, 2003, 19 (13) : 1650 - 1655

← 1 2 3 4 5 6 →