Computer-Assisted Retrosynthesis Based on Molecular Similarity

被引:222
作者
Coley, Connor W. [1 ]
Rogers, Luke [1 ]
Green, William H. [1 ]
Jensen, Klavs F. [1 ]
机构
[1] MIT, Dept Chem Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
AIDED SYNTHESIS DESIGN; ORGANIC-CHEMISTRY; REACTION PREDICTION; KNOWLEDGE-BASE; FINGERPRINT; METHODOLOGY; LANGUAGE; SYSTEM; TOOL;
D O I
10.1021/acscentsci.7b00355
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We demonstrate molecular similarity to be a surprisingly effective metric for proposing and ranking onestep retrosynthetic disconnections based on analogy to precedent reactions. The developed approach mimics the retrosynthetic strategy defined implicitly by a corpus of known reactions without the need to encode any chemical knowledge. Using 40 000 reactions from the patent literature as a knowledge base, the recorded reactants are among the top 10 proposed precursors in 74.1% of 5000 test reactions, providing strong quantitative support for our methodology. Extension of the one-step strategy to multistep pathway planning is demonstrated and discussed for two exemplary drug products.
引用
收藏
页码:1237 / 1245
页数:9
相关论文
共 62 条
[1]  
[Anonymous], ARXIV161209529CS
[2]  
[Anonymous], 1 CLASS SMARTS PATTE
[3]  
[Anonymous], CHEMOINFORMATICS COM
[4]  
[Anonymous], RDKit: Open-source cheminformatics
[5]  
[Anonymous], 1990, SOFTWARE DEV CHEM
[6]  
[Anonymous], 2017, ARXIV170401212
[7]   Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? [J].
Bajusz, David ;
Racz, Anita ;
Heberger, Kroly .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[8]   Lossless compression of chemical fingerprints using integer entropy codes improves storage and retrieval [J].
Baldi, Pierre ;
Benz, Ryan W. ;
Hirschberg, Daniel S. ;
Swamidass, S. Joshua .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (06) :2098-2109
[9]   When is Chemical Similarity Significant? The Statistical Distribution of Chemical Similarity Scores and Its Extreme Values [J].
Baldi, Pierre ;
Nasr, Ramzi .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (07) :1205-1222
[10]   Molecular similarity: a key technique in molecular informatics [J].
Bender, A ;
Glen, RC .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) :3204-3218