Planning chemical syntheses with deep neural networks and symbolic AI

被引:1183
作者
Segler, Marwin H. S. [1 ,2 ,3 ]
Preuss, Mike [4 ]
Waller, Mark P. [5 ,6 ]
机构
[1] Westfalische Wilhelms Univ, Inst Organ Chem, Munster, Germany
[2] Westfalische Wilhelms Univ, Ctr Multiscale Theory & Computat, Munster, Germany
[3] BenevolentAI, London, England
[4] Westfalische Wilhelms Univ Munster, European Res Ctr Informat Syst, Munster, Germany
[5] Shanghai Univ, Dept Phys, Shanghai, Peoples R China
[6] Shanghai Univ, Int Ctr Quantum & Mol Struct, Shanghai, Peoples R China
关键词
ORGANIC-CHEMISTRY; KNOWLEDGE-BASE; SYSTEM; CLASSIFICATION; PREDICTION; REACTIVITY; DESIGN; ROUTE; RETROSYNTHESIS; DISCOVERY;
D O I
10.1038/nature25978
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
To plan the syntheses of small organic molecules, chemists use retrosynthesis, a problem-solving technique in which target molecules are recursively transformed into increasingly simpler precursors. Computer-aided retrosynthesis would be a valuable tool but at present it is slow and provides results of unsatisfactory quality. Here we use Monte Carlo tree search and symbolic artificial intelligence (AI) to discover retrosynthetic routes. We combined Monte Carlo tree search with an expansion policy network that guides the search, and a filter network to pre-select the most promising retrosynthetic steps. These deep neural networks were trained on essentially all reactions ever published in organic chemistry. Our system solves for almost twice as many molecules, thirty times faster than the traditional computer-aided search method, which is based on extracted rules and hand-designed heuristics. In a double-blind AB test, chemists on average considered our computer-generated routes to be equivalent to reported literature routes.
引用
收藏
页码:604 / +
页数:16
相关论文
共 76 条
  • [11] Chollet F., 2015, about us
  • [12] Mining Electronic Laboratory Notebooks: Analysis, Retrosynthesis, and Reaction Based Enumeration
    Christ, Clara D.
    Zentgraf, Matthias
    Kriegl, Jan M.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (07) : 1745 - 1756
  • [13] Clark C, 2015, PR MACH LEARN RES, V37, P1766
  • [14] Clayden J., 2008, Organic Chemistry
  • [15] Computer-Assisted Retrosynthesis Based on Molecular Similarity
    Coley, Connor W.
    Rogers, Luke
    Green, William H.
    Jensen, Klavs F.
    [J]. ACS CENTRAL SCIENCE, 2017, 3 (12) : 1237 - 1245
  • [16] Prediction of Organic Reaction Outcomes Using Machine Learning
    Coley, Connor W.
    Barzilay, Regina
    Jaakkola, Tommi S.
    Green, William H.
    Jensen, Klays F.
    [J]. ACS CENTRAL SCIENCE, 2017, 3 (05) : 434 - 443
  • [17] Collins KD, 2013, NAT CHEM, V5, P597, DOI [10.1038/NCHEM.1669, 10.1038/nchem.1669]
  • [18] Computer-aided synthesis design: 40 years on
    Cook, Anthony
    Johnson, A. Peter
    Law, James
    Mirzazadeh, Mahdi
    Ravitz, Orr
    Simon, Aniko
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2012, 2 (01) : 79 - 107
  • [19] Corey E. J., 1989, the Logic of Chemical Synthesis
  • [20] Coulom R, 2007, ICGA J, V30, P198