Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach

被引:644
作者
Ramakrishnan, Raghunathan [1 ,2 ]
Dral, Pavlo O. [3 ,4 ,5 ]
Rupp, Matthias [1 ,2 ]
von Lilienfeld, O. Anatole [1 ,2 ,6 ]
机构
[1] Univ Basel, Inst Phys Chem, CH-4056 Basel, Switzerland
[2] Univ Basel, Natl Ctr Computat Design & Discovery Novel Mat, Dept Chem, CH-4056 Basel, Switzerland
[3] Max Planck Inst Kohlenforsch, D-45470 Mulheim, Germany
[4] Univ Erlangen Nurnberg, Comp Chem Ctr, D-91052 Erlangen, Germany
[5] Univ Erlangen Nurnberg, Dept Chem & Pharm, Interdisciplinary Ctr Mol Mat, D-91052 Erlangen, Germany
[6] Argonne Natl Lab, Argonne Leadership Comp Facil, Lemont, IL 60439 USA
基金
瑞士国家科学基金会;
关键词
DESIGN; METHODOLOGY; MOLECULES;
D O I
10.1021/acs.jctc.5b00099
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger molecular sets than used for training. For thermochemical properties of up to 16k isomers of C7H10O2 we present numerical evidence that chemical accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of HartreeFock, and we establish a qualitative relationship between molecular entropy and electron correlation. The transferability of our approach is demonstrated, using semiempirical quantum chemistry and machine learning models trained on 1 and 10% of 134k organic molecules, to reproduce enthalpies of all remaining molecules at density functional theory level of accuracy.
引用
收藏
页码:2087 / 2096
页数:10
相关论文
共 51 条
[31]  
Perdew JP, 1996, PHYS REV LETT, V77, P3865, DOI 10.1103/PhysRevLett.77.3865
[32]   GAUSSIAN-1 THEORY - A GENERAL PROCEDURE FOR PREDICTION OF MOLECULAR-ENERGIES [J].
POPLE, JA ;
HEADGORDON, M ;
FOX, DJ ;
RAGHAVACHARI, K ;
CURTISS, LA .
JOURNAL OF CHEMICAL PHYSICS, 1989, 90 (10) :5622-5629
[33]   Nearsightedness of electronic matter [J].
Prodan, E ;
Kohn, W .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (33) :11635-11638
[34]   Quantum chemistry structures and properties of 134 kilo molecules [J].
Ramakrishnan, Raghunathan ;
Dral, Pavlo O. ;
Rupp, Matthias ;
von Lilienfeld, O. Anatole .
SCIENTIFIC DATA, 2014, 1
[35]   Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17 [J].
Ruddigkeit, Lars ;
van Deursen, Ruud ;
Blum, Lorenz C. ;
Reymond, Jean-Louis .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (11) :2864-2875
[36]  
RUPP M, 2012, PHYS REV LETT, V108, DOI DOI 10.1103/PHYSREVLETT.108.058301
[37]   FROM ATOMS AND BONDS TO 3-DIMENSIONAL ATOMIC COORDINATES - AUTOMATIC MODEL BUILDERS [J].
SADOWSKI, J ;
GASTEIGER, J .
CHEMICAL REVIEWS, 1993, 93 (07) :2567-2581
[38]   A comprehensive chemical kinetic combustion model for the four butanol isomers [J].
Sarathy, S. Mani ;
Vranckx, Stijn ;
Yasunaga, Kenji ;
Mehl, Marco ;
Osswald, Patrick ;
Metcalfe, Wayne K. ;
Westbrook, Charles K. ;
Pitz, William J. ;
Kohse-Hoeinghaus, Katharina ;
Fernandes, Ravi X. ;
Curran, Henry J. .
COMBUSTION AND FLAME, 2012, 159 (06) :2028-2055
[39]   Virtual screening: an endless staircase? [J].
Schneider, Gisbert .
NATURE REVIEWS DRUG DISCOVERY, 2010, 9 (04) :273-276
[40]   AB-INITIO CALCULATION OF VIBRATIONAL ABSORPTION AND CIRCULAR-DICHROISM SPECTRA USING DENSITY-FUNCTIONAL FORCE-FIELDS [J].
STEPHENS, PJ ;
DEVLIN, FJ ;
CHABALOWSKI, CF ;
FRISCH, MJ .
JOURNAL OF PHYSICAL CHEMISTRY, 1994, 98 (45) :11623-11627