Deep learning for molecular design-a review of the state of the art

被引：369

作者：

Elton, Daniel C. ^{[1
,3
]}

Boukouvalas, Zois ^{[1
,2
]}

Fuge, Mark D. ^{[1
]}

Chung, Peter W. ^{[1
]}

机构：

[1] Univ Maryland, Dept Mech Engn, College Pk, MD 20740 USA

[2] Amer Univ, Dept Math & Stat, Washington, DC 20016 USA

[3] NIH, Ctr Clin, Bethesda, MD 20892 USA

来源：

MOLECULAR SYSTEMS DESIGN & ENGINEERING | 2019年 / 4卷 / 04期

关键词：

SYNTHETIC ACCESSIBILITY; SELECTION CRITERIA; CHEMICAL LANGUAGE; DRUG DISCOVERY; NEURAL-NETWORK; CHEMISTRY; DATABASE; EXPLORATION; ALGORITHMS; GENERATION;

D O I：

10.1039/c9me00039a

中图分类号：

O64 [物理化学（理论化学）、化学物理学];

学科分类号：

070304 ; 081704 ;

摘要：

In the space of only a few years, deep generative modeling has revolutionized how we think of artificial creativity, yielding autonomous systems which produce original images, music, and text. Inspired by these successes, researchers are now applying deep generative modeling techniques to the generation and optimization of molecules-in our review we found 45 papers on the subject published in the past two years. These works point to a future where such systems will be used to generate lead molecules, greatly reducing resources spent downstream synthesizing and characterizing bad leads in the lab. In this review we survey the increasingly complex landscape of models and representation schemes that have been proposed. The four classes of techniques we describe are recursive neural networks, autoencoders, generative adversarial networks, and reinforcement learning. After first discussing some of the mathematical fundamentals of each technique, we draw high level connections and comparisons with other techniques and expose the pros and cons of each. Several important high level themes emerge as a result of this work, including the shift away from the SMILES string representation of molecules towards more sophisticated representations such as graph grammars and 3D representations, the importance of reward function design, the need for better standards for benchmarking and testing, and the benefits of adversarial training and reinforcement learning over maximum likelihood based training.

引用

页码：828 / 849

页数：22

共 189 条

[1] EnzyNet: enzyme classification using 3D convolutional neural networks on spatial representation
Amidi, Afshine
Amidi, Shervine
Vlachakis, Dimitrios
Megalooikonomou, Vasileios
Paragios, Nikos
Zacharaki, Evangelia, I
[J]. PEERJ, 2018, 6
[2] [Anonymous], 2018, 180609300 ARXIV
[3] [Anonymous], 181100628 ARXIV
[4] [Anonymous], BROAD I MODELS INFER
[5] [Anonymous], 180706156 ARXIV
[6] [Anonymous], ADV NEURAL INFORM PR
[7] [Anonymous], ASME 2016 INT DES EN
[8] [Anonymous], MOL CYCLEGAN GENERAT
[9] [Anonymous], 180509076 ARXIV
[10] [Anonymous], 161101144 ARXIV

← 1 2 3 4 5 6 7 8 9 10 →