Using Machine Learning To Predict Suitable Conditions for Organic Reactions

被引:292
作者
Gao, Hanyu [1 ]
Struble, Thomas J. [1 ]
Coley, Connor W. [1 ]
Wang, Yuran [1 ]
Green, William H. [1 ]
Jensen, Klavs F. [1 ]
机构
[1] MIT, Dept Chem Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
AIDED SYNTHESIS DESIGN; CHEMICAL-REACTIONS; OPTIMIZATION; SINGLE; SYSTEM; FLOW; TOOL;
D O I
10.1021/acscentsci.8b00357
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Reaction condition recommendation is an essential element for the realization of computer-assisted synthetic planning. Accurate suggestions of reaction conditions are required for experimental validation and can have a significant effect on the success or failure of an attempted transformation. However, de novo condition recommendation remains a challenging and under-explored problem and relies heavily on chemists' knowledge and experience. In this work, we develop a neural-network model to predict the chemical context (catalyst(s), solvent(s), reagent(s)), as well as the temperature most suitable for any particular organic reaction. Trained on similar to 10 million examples from Reaxys, the model is able to propose conditions where a close match to the recorded catalyst, solvent, and reagent is found within the top-10 predictions 69.6% of the time, with top-10 accuracies for individual species reaching 80-90%. Temperature is accurately predicted within +/- 20 degrees C from the recorded temperature in 60-70% of test cases, with higher accuracy for cases with correct chemical context predictions. The utility of the model is illustrated through several examples spanning a range of common reaction classes. We also demonstrate that the model implicitly learns a continuous numerical embedding of solvent and reagent species that captures their functional similarity.
引用
收藏
页码:1465 / 1476
页数:12
相关论文
共 48 条
[1]  
[Anonymous], 2017, ARXIV170804202
[2]  
[Anonymous], 2017, ARXIV171104810
[3]  
[Anonymous], 2014, Patent reaction extraction: downloads
[4]   Optimum catalyst selection over continuous and discrete process variables with a single droplet microfluidic reaction platform [J].
Baumgartner, Lorenz M. ;
Coley, Connor W. ;
Reizman, Brandon J. ;
Gao, Kevin W. ;
Jensen, Klavs F. .
REACTION CHEMISTRY & ENGINEERING, 2018, 3 (03) :301-311
[5]   Route Design in the 21st Century: The ICSYNTH Software Tool as an Idea Generator for Synthesis Prediction [J].
Bogevig, Anders ;
Federsel, Hans-Juergen ;
Huerta, Fernando ;
Hutchings, Michael G. ;
Kraut, Hans ;
Langer, Thomas ;
Loew, Peter ;
Oppawsky, Christoph ;
Rein, Tobias ;
Saller, Heinz .
ORGANIC PROCESS RESEARCH & DEVELOPMENT, 2015, 19 (02) :357-368
[6]   Machine Learning in Computer-Aided Synthesis Planning [J].
Coley, Connor W. ;
Green, William H. ;
Jensen, Klays F. .
ACCOUNTS OF CHEMICAL RESEARCH, 2018, 51 (05) :1281-1289
[7]   Computer-Assisted Retrosynthesis Based on Molecular Similarity [J].
Coley, Connor W. ;
Rogers, Luke ;
Green, William H. ;
Jensen, Klavs F. .
ACS CENTRAL SCIENCE, 2017, 3 (12) :1237-1245
[8]   Prediction of Organic Reaction Outcomes Using Machine Learning [J].
Coley, Connor W. ;
Barzilay, Regina ;
Jaakkola, Tommi S. ;
Green, William H. ;
Jensen, Klays F. .
ACS CENTRAL SCIENCE, 2017, 3 (05) :434-443
[9]   Computer-aided synthesis design: 40 years on [J].
Cook, Anthony ;
Johnson, A. Peter ;
Law, James ;
Mirzazadeh, Mahdi ;
Ravitz, Orr ;
Simon, Aniko .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2012, 2 (01) :79-107
[10]   COMPUTER-ASSISTED DESIGN OF COMPLEX ORGANIC SYNTHESES [J].
COREY, EJ ;
WIPKE, WT .
SCIENCE, 1969, 166 (3902) :178-&