Bayesian optimization for conformer generation

被引:41
作者
Chan, Lucian [1 ]
Hutchison, Geoffrey R. [2 ]
Morris, Garrett M. [1 ]
机构
[1] Univ Oxford, Dept Stat, 24-29 St Giles, Oxford OX1 3LB, England
[2] Univ Pittsburgh, Dept Chem & Chem Engn, 219 Parkman Ave, Pittsburgh, PA 15260 USA
来源
JOURNAL OF CHEMINFORMATICS | 2019年 / 11卷
基金
美国国家科学基金会; 英国工程与自然科学研究理事会;
关键词
Bayesian optimization; Gaussian processes; Conformer generation; Rotatable bond; Torsion angle; Conformational space; Molecular energetics; CONFORMATIONAL-ANALYSIS; VALIDATION;
D O I
10.1186/s13321-019-0354-7
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Generating low-energy molecular conformers is a key task for many areas of computational chemistry, molecular modeling and cheminformatics. Most current conformer generation methods primarily focus on generating geometrically diverse conformers rather than finding the most probable or energetically lowest minima. Here, we present a new stochastic search method called the Bayesian optimization algorithm (BOA) for finding the lowest energy conformation of a given molecule. We compare BOA with uniform random search, and systematic search as implemented in Confab, to determine which method finds the lowest energy. Energetic difference, root-mean-square deviation, and torsion fingerprint deviation are used to quantify the performance of the conformer search algorithms. In general, we find BOA requires far fewer evaluations than systematic or uniform random search to find low-energy minima. For molecules with four or more rotatable bonds, Confab typically evaluates 104(median) conformers in its search, while BOA only requires 102 energy evaluations to find top candidates. Despite using evaluating fewer conformers, 20-40% of the time BOA finds lower-energy conformations than a systematic Confab search for molecules with four or more rotatable bonds.
引用
收藏
页数:11
相关论文
共 45 条
[1]   The Cambridge Structural Database: a quarter of a million crystal structures and rising [J].
Allen, FH .
ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE, 2002, 58 (3 PART 1) :380-388
[2]  
[Anonymous], 2008, Chemistry Central Journal
[3]  
[Anonymous], 2018, Molecular Operating Environment (MOE)
[4]  
[Anonymous], ARXIV14024306V2STATM
[5]  
[Anonymous], 2012, P 25 INT C NEUR INF
[6]  
[Anonymous], 2011, RDKit: Open-source cheminformatics
[7]  
[Anonymous], GPYOPT BAYESIAN OPTI
[8]  
[Anonymous], 2014, AUTOMATIC MODEL CONS
[9]  
[Anonymous], SCI REPORTS
[10]  
[Anonymous], ARXIV10122599 CORR