3D-Scaffold: A Deep Learning Framework to Generate 3D Coordinates of Drug-like Molecules with Desired Scaffolds

被引:25
作者
Joshi, Rajendra P. [1 ]
Gebauer, Niklas W. A. [2 ,3 ,4 ]
Bontha, Mridula [1 ]
Khazaieli, Mercedeh [1 ]
James, Rhema M. [1 ]
Brown, James B. [5 ]
Kumar, Neeraj [1 ]
机构
[1] Pacific Northwest Natl Lab, Richland, WA 99352 USA
[2] Tech Univ Berlin, Machine Learning Grp, D-10587 Berlin, Germany
[3] Tech Univ Berlin, BASLEARN TU Berlin BASF Joint Lab Machine Learnin, D-10587 Berlin, Germany
[4] Berlin Inst Fdn Learning & Data, D-10587 Berlin, Germany
[5] Lawrence Berkeley Natl Lab, Environm Genom & Syst Biol, Berkeley, CA 94710 USA
关键词
DISCOVERY; DESIGN;
D O I
10.1021/acs.jpcb.1c06437
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
The prerequisite of therapeutic drug design and discovery is to identify novel molecules and developing lead candidates with desired biophysical and biochemical properties. Deep generative models have demonstrated their ability to find such molecules by exploring a huge chemical space efficiently. An effective way to generate new molecules with desired target properties is by constraining the critical fucntional groups or the core scaffolds in the generation process. To this end, we developed a domain aware generative framework called 3D-Scaffold that takes 3D coordinates of the desired scaffold as an input and generates 3D coordinates of novel therapeutic candidates as an output while always preserving the desired scaffolds in generated structures. We demonstrated that our framework generates predominantly valid, unique, novel, and experimentally synthesizable molecules that have drug-like properties similar to the molecules in the training set. Using domain specific data sets, we generate covalent and noncovalent antiviral inhibitors targeting viral proteins. To measure the success of our framework in generating therapeutic candidates, generated structures were subjected to high throughput virtual screening via docking simulations, which shows favorable interaction against SARS-CoV-2 main protease (Mpro) and nonstructural protein endoribonuclease (NSP15) targets. Most importantly, our deep learning model performs well with relatively small 3D structural training data and quickly learns to generalize to new scaffolds, highlighting its potential application to other domains for generating target specific candidates.
引用
收藏
页码:12166 / 12176
页数:11
相关论文
共 47 条
[1]  
[Anonymous], 2018, ARXIV2018171201664V4
[2]  
[Anonymous], 2017, ARXIV PREPRINT ARXIV
[3]   SMILES-based deep generative scaffold decorator for de-novo drug design [J].
Arus-Pous, Josep ;
Patronov, Atanas ;
Bjerrum, Esben Jannik ;
Tyrchan, Christian ;
Reymond, Jean-Louis ;
Chen, Hongming ;
Engkvist, Ola .
JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
[4]   Randomized SMILES strings improve the quality of molecular generative models [J].
Arus-Pous, Josep ;
Johansson, Simon Viet ;
Prykhodko, Oleksii ;
Bjerrum, Esben Jannik ;
Tyrchan, Christian ;
Reymond, Jean-Louis ;
Chen, Hongming ;
Engkvist, Ola .
JOURNAL OF CHEMINFORMATICS, 2019, 11 (01)
[5]   The properties of known drugs .1. Molecular frameworks [J].
Bemis, GW ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) :2887-2893
[6]   An overview of drug discovery and development [J].
Berdigaliyev, Nurken ;
Aljofan, Mohamad .
FUTURE MEDICINAL CHEMISTRY, 2020, 12 (10) :939-947
[7]   Application of Generative Autoencoder in De Novo Molecular Design [J].
Blaschke, Thomas ;
Olivecrona, Marcus ;
Engkvist, Ola ;
Bajorath, Jurgen ;
Chen, Hongming .
MOLECULAR INFORMATICS, 2018, 37 (1-2)
[8]   Advancing Drug Discovery via Artificial Intelligence [J].
Chan, H. C. Stephen ;
Shan, Hanbin ;
Dahoun, Thamani ;
Vogel, Horst ;
Yuan, Shuguang .
TRENDS IN PHARMACOLOGICAL SCIENCES, 2019, 40 (08) :592-604
[9]   High-throughput screening of the ReFRAME, Pandemic Box, and COVID Box drug repurposing libraries against SARS-CoV-2 nsp15 endoribonuclease to identify small-molecule inhibitors of viral activity [J].
Choi, Ryan ;
Zhou, Mowei ;
Shek, Roger ;
Wilson, Jesse W. ;
Tillery, Logan ;
Craig, Justin K. ;
Salukhe, Indraneel A. ;
Hickson, Sarah E. ;
Kumar, Neeraj ;
James, Rhema M. ;
Buchko, Garry W. ;
Wu, Ruilian ;
Huff, Sydney ;
Nguyen, Tu-Trinh ;
Hurst, Brett L. ;
Cherry, Sara ;
Barrett, Lynn K. ;
Hyde, Jennifer L. ;
Van Voorhis, Wesley C. .
PLOS ONE, 2021, 16 (04)
[10]  
Clyde A., 2021, HIGH THROUGHPUT VIRT