molSimplify: A toolkit for automating discovery in inorganic chemistry

被引:170
作者
Ioannidis, Efthymios I. [1 ]
Gani, Terry Z. H. [1 ]
Kulik, Heather J. [1 ]
机构
[1] MIT, Dept Chem Engn, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
chemical discovery; structure generation; first-principles simulation; high-throughput screening; !text type='python']python[!/text; MOLECULAR DESIGN; FORCE-FIELD; PLATFORM; PROGRAM; LIBRARY; CDK;
D O I
10.1002/jcc.24437
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We present an automated, open source toolkit for the first-principles screening and discovery of new inorganic molecules and intermolecular complexes. Challenges remain in the automatic generation of candidate inorganic molecule structures due to the high variability in coordination and bonding, which we overcome through a divide-and-conquer tactic that flexibly combines force-field preoptimization of organic fragments with alignment to first-principles-trained metal-ligand distances. Exploration of chemical space is enabled through random generation of ligands and intermolecular complexes from large chemical databases. We validate the generated structures with the root mean squared (RMS) gradients evaluated from density functional theory (DFT), which are around 0.02 Ha/au across a large 150 molecule test set. Comparison of molSimplify results to full optimization with the universal force field reveals that RMS DFT gradients are improved by 40%. Seamless generation of input files, preparation and execution of electronic structure calculations, and post-processing for each generated structure aids interpretation of underlying chemical and energetic trends. (c) 2016 Wiley Periodicals, Inc.
引用
收藏
页码:2106 / 2117
页数:12
相关论文
共 54 条
[1]  
[Anonymous], PYTH LANG REF
[2]   A QUANTUM-THEORY OF MOLECULAR-STRUCTURE AND ITS APPLICATIONS [J].
BADER, RFW .
CHEMICAL REVIEWS, 1991, 91 (05) :893-928
[3]   An object-oriented scripting interface to a legacy electronic structure code [J].
Bahn, SR ;
Jacobsen, KW .
COMPUTING IN SCIENCE & ENGINEERING, 2002, 4 (03) :56-66
[4]   A SIMPLE MEASURE OF ELECTRON LOCALIZATION IN ATOMIC AND MOLECULAR-SYSTEMS [J].
BECKE, AD ;
EDGECOMBE, KE .
JOURNAL OF CHEMICAL PHYSICS, 1990, 92 (09) :5397-5403
[5]   DENSITY-FUNCTIONAL THERMOCHEMISTRY .3. THE ROLE OF EXACT EXCHANGE [J].
BECKE, AD .
JOURNAL OF CHEMICAL PHYSICS, 1993, 98 (07) :5648-5652
[6]   KNIME-CDK: Workflow-driven cheminformatics [J].
Beisken, Stephan ;
Meinl, Thorsten ;
Wiswedel, Bernd ;
de Figueiredo, Luis F. ;
Berthold, Michael ;
Steinbeck, Christoph .
BMC BIOINFORMATICS, 2013, 14
[7]  
Bolton EE, 2010, ANN REP COMP CHEM, V4, P217, DOI 10.1016/S1574-1400(08)00012-1
[8]   In silico design in homogeneous catalysis using descriptor modelling [J].
Burello, Enrico ;
Rothenberg, Gadi .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2006, 7 (09) :375-404
[9]   ChemmineR: a compound mining framework for R [J].
Cao, Yiqun ;
Charisi, Anna ;
Cheng, Li-Chang ;
Jiang, Tao ;
Girke, Thomas .
BIOINFORMATICS, 2008, 24 (15) :1733-1734
[10]  
Chen X, 2001, COMB CHEM HIGH T SCR, V4, P719