Machine-learning-assisted materials discovery using failed experiments

被引:1263
作者
Raccuglia, Paul [1 ]
Elbert, Katherine C. [1 ]
Adler, Philip D. F. [1 ]
Falk, Casey [1 ]
Wenny, Malia B. [1 ]
Mollo, Aurelio [1 ]
Zeller, Matthias [2 ]
Friedler, Sorelle A. [1 ]
Schrier, Joshua [1 ]
Norquist, Alexander J. [1 ]
机构
[1] Haverford Coll, 370 Lancaster Ave, Haverford, PA 19041 USA
[2] Purdue Univ, Dept Chem, 560 Oval Dr, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
METAL-ORGANIC FRAMEWORKS; CRYSTAL-STRUCTURE; SELENITES; SOLIDS;
D O I
10.1038/nature17439
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Inorganic-organic hybrid materials(1-3) such as organically templated metal oxides(1), metal-organic frameworks (MOFs)(2) and organohalide perovskites(4) have been studied for decades, and hydrothermal and (non-aqueous) solvothermal syntheses have produced thousands of new materials that collectively contain nearly all the metals in the periodic table(5-9). Nevertheless, the formation of these compounds is not fully understood, and development of new compounds relies primarily on exploratory syntheses. Simulation-and data-driven approaches (promoted by efforts such as the Materials Genome Initiative(10)) provide an alternative to experimental trial-and-error. Three major strategies are: simulation-based predictions of physical properties (for example, charge mobility(11), photovoltaic properties(12), gas adsorption capacity(13) or lithium-ion intercalation(14)) to identify promising target candidates for synthetic efforts(11,15); determination of the structure-property relationship from large bodies of experimental data(16,17), enabled by integration with high-throughput synthesis and measurement tools(18); and clustering on the basis of similar crystallographic structure (for example, zeolite structure classification(19,20) or gas adsorption properties(21)). Here we demonstrate an alternative approach that uses machine-learning algorithms trained on reaction data to predict reaction outcomes for the crystallization of templated vanadium selenites. We used information on 'dark' reactions-failed or unsuccessful hydrothermal syntheses-collected from archived laboratory notebooks from our laboratory, and added physicochemical property descriptions to the raw notebook information using cheminformatics techniques. We used the resulting data to train a machine-learning model to predict reaction success. When carrying out hydrothermal synthesis experiments using previously untested, commercially available organic building blocks, our machine-learning model outperformed traditional human strategies, and successfully predicted conditions for new organically templated inorganic product formation with a success rate of 89 per cent. Inverting the machine-learning model reveals new hypotheses regarding the conditions for successful product formation.
引用
收藏
页码:73 / +
页数:5
相关论文
共 41 条
[11]   Microporous solids:: From organically templated inorganic skeletons to hybrid frameworks ... ecumenism in chemistry [J].
Férey, G .
CHEMISTRY OF MATERIALS, 2001, 13 (10) :3084-3098
[12]   OXYFLUORINATED MICROPOROUS COMPOUNDS ULM-N - CHEMICAL-PARAMETERS, STRUCTURES AND A PROPOSED MECHANISM FOR THEIR MOLECULAR TECTONICS [J].
FEREY, G .
JOURNAL OF FLUORINE CHEMISTRY, 1995, 72 (02) :187-193
[13]   Rapid and Accurate Machine Learning Recognition of High Performing Metal Organic Frameworks for CO2 Capture [J].
Fernandez, Michael ;
Boyd, Peter G. ;
Daff, Thomas D. ;
Aghaji, Mohammad Zein ;
Woo, Tom K. .
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2014, 5 (17) :3056-3060
[14]   Data-Driven Review of Thermoelectric Materials: Performance and Resource Considerations [J].
Gaultois, Michael W. ;
Sparks, Taylor D. ;
Borg, Christopher K. H. ;
Seshadri, Ram ;
Bonificio, William D. ;
Clarke, David R. .
CHEMISTRY OF MATERIALS, 2013, 25 (15) :2911-2920
[15]   Sixth blind test of organic crystal-structure prediction methods [J].
Groom, Colin R. ;
Reilly, Anthony M. .
ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE CRYSTAL ENGINEERING AND MATERIALS, 2014, 70 :776-777
[16]   Lead candidates for high-performance organic photovoltaics from high-throughput quantum chemistry - the Harvard Clean Energy Project [J].
Hachmann, Johannes ;
Olivares-Amaya, Roberto ;
Jinich, Adrian ;
Appleton, Anthony L. ;
Blood-Forsythe, Martin A. ;
Seress, Laszlo R. ;
Roman-Salgado, Carolina ;
Trepte, Kai ;
Atahan-Evrenk, Sule ;
Er, Sueleyman ;
Shrestha, Supriya ;
Mondal, Rajib ;
Sokolov, Anatoliy ;
Bao, Zhenan ;
Aspuru-Guzik, Alan .
ENERGY & ENVIRONMENTAL SCIENCE, 2014, 7 (02) :698-704
[17]  
Hall M., 2009, SIGKDD EXPLORATIONS, V11, P10, DOI [DOI 10.1145/1656274.1656278, 10.1145/1656274.1656278]
[18]  
Hastie T., 2009, ELEMENTS STAT LEARNI
[19]   REDUCED MOLYBDENUM PHOSPHATES - OCTAHEDRAL TETRAHEDRAL FRAMEWORK SOLIDS WITH TUNNELS, CAGES, AND MICROPORES [J].
HAUSHALTER, RC ;
MUNDI, LA .
CHEMISTRY OF MATERIALS, 1992, 4 (01) :31-48
[20]   Finding Nature's Missing Ternary Oxide Compounds Using Machine Learning and Density Functional Theory [J].
Hautier, Geoffroy ;
Fischer, Christopher C. ;
Jain, Anubhav ;
Mueller, Tim ;
Ceder, Gerbrand .
CHEMISTRY OF MATERIALS, 2010, 22 (12) :3762-3767