Planning chemical syntheses with deep neural networks and symbolic AI

被引:1183
作者
Segler, Marwin H. S. [1 ,2 ,3 ]
Preuss, Mike [4 ]
Waller, Mark P. [5 ,6 ]
机构
[1] Westfalische Wilhelms Univ, Inst Organ Chem, Munster, Germany
[2] Westfalische Wilhelms Univ, Ctr Multiscale Theory & Computat, Munster, Germany
[3] BenevolentAI, London, England
[4] Westfalische Wilhelms Univ Munster, European Res Ctr Informat Syst, Munster, Germany
[5] Shanghai Univ, Dept Phys, Shanghai, Peoples R China
[6] Shanghai Univ, Int Ctr Quantum & Mol Struct, Shanghai, Peoples R China
关键词
ORGANIC-CHEMISTRY; KNOWLEDGE-BASE; SYSTEM; CLASSIFICATION; PREDICTION; REACTIVITY; DESIGN; ROUTE; RETROSYNTHESIS; DISCOVERY;
D O I
10.1038/nature25978
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
To plan the syntheses of small organic molecules, chemists use retrosynthesis, a problem-solving technique in which target molecules are recursively transformed into increasingly simpler precursors. Computer-aided retrosynthesis would be a valuable tool but at present it is slow and provides results of unsatisfactory quality. Here we use Monte Carlo tree search and symbolic artificial intelligence (AI) to discover retrosynthetic routes. We combined Monte Carlo tree search with an expansion policy network that guides the search, and a filter network to pre-select the most promising retrosynthetic steps. These deep neural networks were trained on essentially all reactions ever published in organic chemistry. Our system solves for almost twice as many molecules, thirty times faster than the traditional computer-aided search method, which is based on extracted rules and hand-designed heuristics. In a double-blind AB test, chemists on average considered our computer-generated routes to be equivalent to reported literature routes.
引用
收藏
页码:604 / +
页数:16
相关论文
共 76 条
  • [51] Extended-Connectivity Fingerprints
    Rogers, David
    Hahn, Mathew
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (05) : 742 - 754
  • [52] HORACE - AN AUTOMATIC SYSTEM FOR THE HIERARCHICAL-CLASSIFICATION OF CHEMICAL-REACTIONS
    ROSE, JR
    GASTEIGER, J
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (01): : 74 - 90
  • [53] Multi-armed bandits with episode context
    Rosin, Christopher D.
    [J]. ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2011, 61 (03) : 203 - 230
  • [54] SOPHIA, A KNOWLEDGE BASE-GUIDED REACTION PREDICTION SYSTEM - UTILIZATION OF A KNOWLEDGE-BASE DERIVED FROM A REACTION DATABASE
    SATOH, H
    FUNATSU, K
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1995, 35 (01): : 34 - 44
  • [55] Schneider N., 2015, J CHEM INF MOD, V55
  • [56] De Novo Design at the Edge of Chaos
    Schneider, Petra
    Schneidert, Gisbert
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2016, 59 (09) : 4077 - 4086
  • [57] Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks
    Segler, Marwin H. S.
    Kogej, Thierry
    Tyrchan, Christian
    Waller, Mark P.
    [J]. ACS CENTRAL SCIENCE, 2018, 4 (01) : 120 - 131
  • [58] Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction
    Segler, Marwin H. S.
    Waller, Mark P.
    [J]. CHEMISTRY-A EUROPEAN JOURNAL, 2017, 23 (25) : 5966 - 5971
  • [59] Modelling Chemical Reasoning to Predict and Invent Reactions
    Segler, Marwin H. S.
    Waller, Mark P.
    [J]. CHEMISTRY-A EUROPEAN JOURNAL, 2017, 23 (25) : 6118 - 6128
  • [60] Time-Split Cross-Validation as a Method for Estimating the Goodness of Prospective Prediction
    Sheridan, Robert P.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (04) : 783 - 790