RetroTransformDB: A Dataset of Generic Transforms for Retrosynthetic Analysis

被引:9
作者
Avramova, Svetlana [1 ]
Kochev, Nikolay [1 ]
Angelov, Plamen [1 ]
机构
[1] Univ Plovidv P Hilendarski, Fac Chem, 24 Tsar Assen Str, Plovdiv 4000, Bulgaria
关键词
transforms; retrosynthesis; SMIRKS;
D O I
10.3390/data3020014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Presently, software tools for retrosynthetic analysis are widely used by organic, medicinal, and computational chemists. Rule-based systems extensively use collections of retro-reactions (transforms). While there are many public datasets with reactions in synthetic direction (usually non-generic reactions), there are no publicly-available databases with generic reactions in computer-readable format which can be used for the purposes of retrosynthetic analysis. Here we present RetroTransformDB-a dataset of transforms, compiled and coded in SMIRKS line notation by us. The collection is comprised of more than 100 records, with each one including the reaction name, SMIRKS linear notation, the functional group to be obtained, and the transform type classification. All SMIRKS transforms were tested syntactically, semantically, and from a chemical point of view in different software platforms. The overall dataset design and the retrosynthetic fitness were analyzed and curated by organic chemistry experts. The RetroTransformDB dataset may be used by open-source and commercial software packages, as well as chemoinformatics tools.
引用
收藏
页数:6
相关论文
共 25 条
  • [1] Angelo J.D., 2015, HYBRID RETROSYNTHESI
  • [2] Artificial intelligence in synthetic chemistry: achievements and prospects
    Baskin, Igor I.
    Madzhidov, Timur I.
    Antipin, Igor S.
    Varnek, Alexandre A.
    [J]. RUSSIAN CHEMICAL REVIEWS, 2017, 86 (11) : 1127 - 1156
  • [3] No Electron Left Behind: A Rule-Based Expert System To Predict Chemical Reactions and Reaction Mechanisms
    Chen, Jonathan H.
    Baldi, Pierre
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (09) : 2034 - 2043
  • [4] Over 20 years of reaction access systems from MDL: A novel reaction substructure search algorithm
    Chen, LG
    Nourse, JG
    Christie, BD
    Leland, BA
    Grier, DL
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06): : 1296 - 1310
  • [5] Automatic reaction mapping and reaction center detection
    Chen, William Lingran
    Chen, David Z.
    Taylor, Keith T.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2013, 3 (06) : 560 - 593
  • [6] Corey E. J., 1989, LOGIC CHEM SYNTHESIS
  • [7] DOGS: Reaction-Driven de novo Design of Bioactive Compounds
    Hartenfeller, Markus
    Zettl, Heiko
    Walter, Miriam
    Rupp, Matthias
    Reisen, Felix
    Proschak, Ewgenij
    Weggen, Sascha
    Stark, Holger
    Schneider, Gisbert
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (02)
  • [8] A Collection of Robust Organic Synthesis Reactions for In Silico Molecule Design
    Hartenfeller, Markus
    Eberle, Martin
    Meier, Peter
    Nieto-Oberhuber, Cristina
    Altmann, Karl-Heinz
    Schneider, Gisbert
    Jacoby, Edgar
    Renner, Steffen
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (12) : 3093 - 3098
  • [9] Hierarchical Analysis of Bioactive Matched Molecular Pairs, Encoded Chemical Transformations, and Associated Substructures
    Hu, Ye
    Bajorath, Juergen
    [J]. MOLECULAR INFORMATICS, 2016, 35 (10) : 483 - 488
  • [10] Chemical Transformations That Yield Compounds with Distinct Activity Profiles
    Hu, Ye
    Bajorath, Juergen
    [J]. ACS MEDICINAL CHEMISTRY LETTERS, 2011, 2 (07): : 523 - 527