A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction

被引:20
作者
Spencer, Matt [1 ]
Eickholt, Jesse [2 ]
Cheng, Jianlin [3 ]
机构
[1] Univ Missouri, Inst Informat, Columbia, MO 65211 USA
[2] Cent Michigan Univ, Dept Comp Sci, Mt Pleasant, MI 48859 USA
[3] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
基金
美国国家卫生研究院;
关键词
Machine learning; neural nets; protein structure prediction; deep learning; NEURAL-NETWORKS; GENERATION; ACCURATE;
D O I
10.1109/TCBB.2014.2343960
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Ab initio protein secondary structure (SS) predictions are utilized to generate tertiary structure predictions, which are increasingly demanded due to the rapid discovery of proteins. Although recent developments have slightly exceeded previous methods of SS prediction, accuracy has stagnated around 80 percent and many wonder if prediction cannot be advanced beyond this ceiling. Disciplines that have traditionally employed neural networks are experimenting with novel deep learning techniques in attempts to stimulate progress. Since neural networks have historically played an important role in SS prediction, we wanted to determine whether deep learning could contribute to the advancement of this field as well. We developed an SS predictor that makes use of the position-specific scoring matrix generated by PSI-BLAST and deep learning network architectures, which we call DNSS. Graphical processing units and CUDA software optimize the deep network architecture and efficiently train the deep networks. Optimal parameters for the training process were determined, and a workflow comprising three separately trained deep networks was constructed in order to make refined predictions. This deep learning network approach was used to predict SS for a fully independent test dataset of 198 proteins, achieving a Q(3) accuracy of 80.7 percent and a Sov accuracy of 74.2 percent.
引用
收藏
页码:103 / 112
页数:10
相关论文
共 54 条
  • [41] Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles
    Pollastri, G
    Przybylski, D
    Rost, B
    Baldi, P
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 47 (02) : 228 - 235
  • [42] Improving fold recognition without folds
    Przybylski, D
    Rost, B
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2004, 341 (01) : 255 - 269
  • [43] A Unified Multitask Architecture for Predicting Local Protein Properties
    Qi, Yanjun
    Oja, Merja
    Weston, Jason
    Noble, William Stafford
    [J]. PLOS ONE, 2012, 7 (03):
  • [44] COMBINING EVOLUTIONARY INFORMATION AND NEURAL NETWORKS TO PREDICT PROTEIN SECONDARY STRUCTURE
    ROST, B
    SANDER, C
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1994, 19 (01) : 55 - 72
  • [45] Neural network-based face detection
    Rowley, HA
    Baluja, S
    Kanade, T
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (01) : 23 - 38
  • [46] Smolensky P., 1986, Information processing in dynamical systems: Foundations of Harmony Theory,'' Parallel Distributed Processing: Exploration in the Microstructure of Cognition, V1
  • [47] Stork D. G., 1992, IJCNN International Joint Conference on Neural Networks (Cat. No.92CH3114-6), P289, DOI 10.1109/IJCNN.1992.226994
  • [48] ESpritz: accurate and fast prediction of protein disorder
    Walsh, Ian
    Martin, Alberto J. M.
    Di Domenico, Tomas
    Tosatto, Silvio C. E.
    [J]. BIOINFORMATICS, 2012, 28 (04) : 503 - 509
  • [49] Protein 8-class secondary structure prediction using conditional neural fields
    Wang, Zhiyong
    Zhao, Feng
    Peng, Jian
    Xu, Jinbo
    [J]. PROTEOMICS, 2011, 11 (19) : 3786 - 3792
  • [50] Secondary structure prediction with support vector machines
    Ward, JJ
    McGuffin, LJ
    Buxton, BF
    Jones, DT
    [J]. BIOINFORMATICS, 2003, 19 (13) : 1650 - 1655