Controlling an organic synthesis robot with machine learning to search for new reactivity

被引:523
作者
Granda, Jaroslaw M. [1 ]
Donina, Liva [1 ]
Dragone, Vincenza [1 ]
Long, De-Liang [1 ]
Cronin, Leroy [1 ]
机构
[1] Univ Glasgow, Sch Chem, Glasgow, Lanark, Scotland
基金
英国工程与自然科学研究理事会;
关键词
DISCOVERY; REPRESENTATION; DESIGN;
D O I
10.1038/s41586-018-0307-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The discovery of chemical reactions is an inherently unpredictable and time-consuming process(1). An attractive alternative is to predict reactivity, although relevant approaches, such as computer-aided reaction design, are still in their infancy(2). Reaction prediction based on high-level quantum chemical methods is complex(3) even for simple molecules. Although machine learning is powerful for data analysis(4,5), its applications in chemistry are still being developed'. Inspired by strategies based on chemists' intuition, we propose that a reaction system controlled by a machine learning algorithm may be able to explore the space of chemical reactions quickly, especially if trained by an expert(8). Here we present an organic synthesis robot that can perform chemical reactions and analysis faster than they can be performed manually, as well as predict the reactivity of possible reagent combinations after conducting a small number of experiments, thus effectively navigating chemical reaction space. By using machine learning for decision making, enabled by binary encoding of the chemical inputs, the reactions can be assessed in real time using nuclear magnetic resonance and infrared spectroscopy. The machine learning system was able to predict the reactivity of about 1,000 reaction combinations with accuracy greater than 80 per cent after considering the outcomes of slightly over 10 per cent of the dataset. This approach was also used to calculate the reactivity of published datasets. Further, by using real-time data from our robot, these predictions were followed up manually by a chemist, leading to the discovery of four reactions.
引用
收藏
页码:377 / +
页数:8
相关论文
共 24 条
[1]   Predicting reaction performance in C-N cross-coupling using machine learning [J].
Ahneman, Derek T. ;
Estrada, Jesus G. ;
Lin, Shishi ;
Dreher, Spencer D. ;
Doyle, Abigail G. .
SCIENCE, 2018, 360 (6385) :186-190
[2]   Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? [J].
Bajusz, David ;
Racz, Anita ;
Heberger, Kroly .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[3]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[4]  
Collins KD, 2014, NAT CHEM, V6, P859, DOI [10.1038/NCHEM.2062, 10.1038/nchem.2062]
[5]  
COOMANS D, 1978, ANAL CHIM ACTA-COMP, V2, P409
[6]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[7]   An autonomous organic reaction search engine for chemical reactivity [J].
Dragone, Vincenza ;
Sans, Victor ;
Henson, Alon B. ;
Granda, Jaroslaw M. ;
Cronin, Leroy .
NATURE COMMUNICATIONS, 2017, 8
[8]   Amplify scientific discovery with artificial intelligence [J].
Gil, Yolanda ;
Greaves, Mark ;
Hendler, James ;
Hirsh, Haym .
SCIENCE, 2014, 346 (6206) :171-172
[9]   Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules [J].
Gomez-Bombarelli, Rafael ;
Wei, Jennifer N. ;
Duvenaud, David ;
Hernandez-Lobato, Jose Miguel ;
Sanchez-Lengeling, Benjamin ;
Sheberla, Dennis ;
Aguilera-Iparraguirre, Jorge ;
Hirzel, Timothy D. ;
Adams, Ryan P. ;
Aspuru-Guzik, Alan .
ACS CENTRAL SCIENCE, 2018, 4 (02) :268-276
[10]   Heuristic thinking makes a chemist smart [J].
Graulich, Nicole ;
Hopf, Henning ;
Schreiner, Peter R. .
CHEMICAL SOCIETY REVIEWS, 2010, 39 (05) :1503-1512