Machine-learning-assisted materials discovery using failed experiments

被引:1263
作者
Raccuglia, Paul [1 ]
Elbert, Katherine C. [1 ]
Adler, Philip D. F. [1 ]
Falk, Casey [1 ]
Wenny, Malia B. [1 ]
Mollo, Aurelio [1 ]
Zeller, Matthias [2 ]
Friedler, Sorelle A. [1 ]
Schrier, Joshua [1 ]
Norquist, Alexander J. [1 ]
机构
[1] Haverford Coll, 370 Lancaster Ave, Haverford, PA 19041 USA
[2] Purdue Univ, Dept Chem, 560 Oval Dr, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
METAL-ORGANIC FRAMEWORKS; CRYSTAL-STRUCTURE; SELENITES; SOLIDS;
D O I
10.1038/nature17439
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Inorganic-organic hybrid materials(1-3) such as organically templated metal oxides(1), metal-organic frameworks (MOFs)(2) and organohalide perovskites(4) have been studied for decades, and hydrothermal and (non-aqueous) solvothermal syntheses have produced thousands of new materials that collectively contain nearly all the metals in the periodic table(5-9). Nevertheless, the formation of these compounds is not fully understood, and development of new compounds relies primarily on exploratory syntheses. Simulation-and data-driven approaches (promoted by efforts such as the Materials Genome Initiative(10)) provide an alternative to experimental trial-and-error. Three major strategies are: simulation-based predictions of physical properties (for example, charge mobility(11), photovoltaic properties(12), gas adsorption capacity(13) or lithium-ion intercalation(14)) to identify promising target candidates for synthetic efforts(11,15); determination of the structure-property relationship from large bodies of experimental data(16,17), enabled by integration with high-throughput synthesis and measurement tools(18); and clustering on the basis of similar crystallographic structure (for example, zeolite structure classification(19,20) or gas adsorption properties(21)). Here we demonstrate an alternative approach that uses machine-learning algorithms trained on reaction data to predict reaction outcomes for the crystallization of templated vanadium selenites. We used information on 'dark' reactions-failed or unsuccessful hydrothermal syntheses-collected from archived laboratory notebooks from our laboratory, and added physicochemical property descriptions to the raw notebook information using cheminformatics techniques. We used the resulting data to train a machine-learning model to predict reaction success. When carrying out hydrothermal synthesis experiments using previously untested, commercially available organic building blocks, our machine-learning model outperformed traditional human strategies, and successfully predicted conditions for new organically templated inorganic product formation with a success rate of 89 per cent. Inverting the machine-learning model reveals new hypotheses regarding the conditions for successful product formation.
引用
收藏
页码:73 / +
页数:5
相关论文
共 41 条
[1]   The Cambridge Structural Database: a quarter of a million crystal structures and rising [J].
Allen, FH .
ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE, 2002, 58 (3 PART 1) :380-388
[2]  
[Anonymous], 2013, JCHEM 6 1 3
[3]  
Barakat N., 2005, International Journal of Computational Intelligence, V2, P59
[4]   A New Era for ab initio Molecular Crystal Lattice Energy Prediction [J].
Beran, Gregory J. O. .
ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2015, 54 (02) :396-398
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]  
Cheetham AK, 1999, ANGEW CHEM INT EDIT, V38, P3268, DOI 10.1002/(SICI)1521-3773(19991115)38:22<3268::AID-ANIE3268>3.0.CO
[7]  
2-U
[8]   High-throughput computational screening of metal-organic frameworks [J].
Colon, Yamil J. ;
Snurr, Randall Q. .
CHEMICAL SOCIETY REVIEWS, 2014, 43 (16) :5735-5749
[9]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[10]   The hydrothermal synthesis of zeolites: History and development from the earliest days to the present time [J].
Cundy, CS ;
Cox, PA .
CHEMICAL REVIEWS, 2003, 103 (03) :663-701