Deep learning for computational chemistry

被引:547
作者
Goh, Garrett B. [1 ]
Hodas, Nathan O. [1 ]
Vishnu, Abhinav [1 ]
机构
[1] Pacific Northwest Natl Lab, Adv Comp Math & Data Div, 902 Battelle Blvd, Richland, WA 99354 USA
关键词
deep learning; machine learning; molecular modeling; quantitative structure activity relationship; quantum chemistry; materials genome; toxicology; protein structure prediction; cheminformatics; artificial intelligence; SECONDARY STRUCTURE PREDICTION; ACCESSIBLE SURFACE-AREA; SUPPORT VECTOR MACHINES; PROTEIN-STRUCTURE; NEURAL-NETWORKS; PHYSICAL-PROPERTIES; NEW-GENERATION; RANDOM FOREST; GAPPED BLAST; FORCE-FIELD;
D O I
10.1002/jcc.24764
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The rise and fall of artificial neural networks is well documented in the scientific literature of both computer science and computational chemistry. Yet almost two decades later, we are now seeing a resurgence of interest in deep learning, a machine learning algorithm based on multilayer neural networks. Within the last few years, we have seen the transformative impact of deep learning in many domains, particularly in speech recognition and computer vision, to the extent that the majority of expert practitioners in those field are now regularly eschewing prior established models in favor of deep learning models. In this review, we provide an introductory overview into the theory of deep neural networks and their unique properties that distinguish them from traditional machine learning algorithms used in cheminformatics. By providing an overview of the variety of emerging applications of deep neural networks, we highlight its ubiquity and broad applicability to a wide range of challenges in the field, including quantitative structure activity relationship, virtual screening, protein structure prediction, quantum chemistry, materials design, and property prediction. In reviewing the performance of deep neural networks, we observed a consistent outperformance against non-neural networks state-of-the-art models across disparate research topics, and deep neural network-based models often exceeded the glass ceiling expectations of their respective tasks. Coupled with the maturity of GPU-accelerated computing for training deep neural networks and the exponential growth of chemical data on which to train these networks on, we anticipate that deep learning algorithms will be a valuable tool for computational chemistry. (c) 2017 Wiley Periodicals, Inc.
引用
收藏
页码:1291 / 1307
页数:17
相关论文
共 145 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Consensus Modeling for HTS Assays Using In silico Descriptors Calculates the Best Balanced Accuracy in Tox21 Challenge
    Abdelaziz, Ahmed
    Spahn-Langguth, Hilde
    Schramm, Karl-Werner
    Tetko, Igor, V
    [J]. FRONTIERS IN ENVIRONMENTAL SCIENCE, 2016, 4
  • [3] Can we learn to distinguish between "drug-like" and "nondrug-like" molecules?
    Ajay
    Walters, WP
    Murcko, MA
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1998, 41 (18) : 3314 - 3324
  • [4] Al-Rfou R., 2016, Theano: A Python framework for fast computation of mathematical expressions, V472, P473
  • [5] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [6] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [7] [Anonymous], 2016, WHY SHOULD I TRUST Y
  • [8] [Anonymous], 2016, On the expressive power of deep neural networks
  • [9] [Anonymous], 2014, NIPS WORKSH DEEP LEA
  • [10] [Anonymous], 2016, P C ASS MACH TRANSL