A graph-convolutional neural network model for the prediction of chemical reactivity

被引:468
作者
Coley, Connor W. [1 ]
Jin, Wengong [2 ]
Rogers, Luke [1 ]
Jamison, Timothy F. [3 ]
Jaakkola, Tommi S. [2 ]
Green, William H. [1 ]
Barzilay, Regina [2 ]
Jensen, Klavs F. [1 ]
机构
[1] MIT, Dept Chem Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] MIT, Dept Chem, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
AIDED SYNTHESIS DESIGN; ORGANIC-REACTIONS; RETROSYNTHESIS; SYSTEM; TOOL;
D O I
10.1039/c8sc04228d
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We present a supervised learning approach to predict the products of organic reactions given their reactants, reagents, and solvent(s). The prediction task is factored into two stages comparable to manual expert approaches: considering possible sites of reactivity and evaluating their relative likelihoods. By training on hundreds of thousands of reaction precedents covering a broad range of reaction types from the patent literature, the neural model makes informed predictions of chemical reactivity. The model predicts the major product correctly over 85% of the time requiring around 100 ms per example, a significantly higher accuracy than achieved by previous machine learning approaches, and performs on par with expert chemists with years of formal training. We gain additional insight into predictions via the design of the neural model, revealing an understanding of chemistry qualitatively consistent with manual approaches.
引用
收藏
页码:370 / 377
页数:8
相关论文
共 39 条
  • [1] Predicting reaction performance in C-N cross-coupling using machine learning
    Ahneman, Derek T.
    Estrada, Jesus G.
    Lin, Shishi
    Dreher, Spencer D.
    Doyle, Abigail G.
    [J]. SCIENCE, 2018, 360 (6385) : 186 - 190
  • [2] [Anonymous], ARXIV170509037
  • [3] [Anonymous], 2017, ARXIV171104810
  • [4] [Anonymous], 2014, Patent reaction extraction: downloads
  • [5] Bahdanau Dzmitry, 2015, 3 INT C LEARN REPR I
  • [6] APPLICATIONS OF GRAPH-THEORY IN CHEMISTRY
    BALABAN, AT
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (03): : 334 - 343
  • [7] Route Design in the 21st Century: The ICSYNTH Software Tool as an Idea Generator for Synthesis Prediction
    Bogevig, Anders
    Federsel, Hans-Juergen
    Huerta, Fernando
    Hutchings, Michael G.
    Kraut, Hans
    Langer, Thomas
    Loew, Peter
    Oppawsky, Christoph
    Rein, Tobias
    Saller, Heinz
    [J]. ORGANIC PROCESS RESEARCH & DEVELOPMENT, 2015, 19 (02) : 357 - 368
  • [8] BRADSHAW J, 2018, ARXIV180510970PHYSIC
  • [9] Mining Electronic Laboratory Notebooks: Analysis, Retrosynthesis, and Reaction Based Enumeration
    Christ, Clara D.
    Zentgraf, Matthias
    Kriegl, Jan M.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (07) : 1745 - 1756
  • [10] Machine Learning in Computer-Aided Synthesis Planning
    Coley, Connor W.
    Green, William H.
    Jensen, Klays F.
    [J]. ACCOUNTS OF CHEMICAL RESEARCH, 2018, 51 (05) : 1281 - 1289