RetroTransformDB: A Dataset of Generic Transforms for Retrosynthetic Analysis

被引:9
作者
Avramova, Svetlana [1 ]
Kochev, Nikolay [1 ]
Angelov, Plamen [1 ]
机构
[1] Univ Plovidv P Hilendarski, Fac Chem, 24 Tsar Assen Str, Plovdiv 4000, Bulgaria
关键词
transforms; retrosynthesis; SMIRKS;
D O I
10.3390/data3020014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Presently, software tools for retrosynthetic analysis are widely used by organic, medicinal, and computational chemists. Rule-based systems extensively use collections of retro-reactions (transforms). While there are many public datasets with reactions in synthetic direction (usually non-generic reactions), there are no publicly-available databases with generic reactions in computer-readable format which can be used for the purposes of retrosynthetic analysis. Here we present RetroTransformDB-a dataset of transforms, compiled and coded in SMIRKS line notation by us. The collection is comprised of more than 100 records, with each one including the reaction name, SMIRKS linear notation, the functional group to be obtained, and the transform type classification. All SMIRKS transforms were tested syntactically, semantically, and from a chemical point of view in different software platforms. The overall dataset design and the retrosynthetic fitness were analyzed and curated by organic chemistry experts. The RetroTransformDB dataset may be used by open-source and commercial software packages, as well as chemoinformatics tools.
引用
收藏
页数:6
相关论文
共 25 条
  • [11] RASA: A Rapid Retrosynthesis-Based Scoring Method for the Assessment of Synthetic Accessibility of Drug-like Molecules
    Huang, Qi
    Li, Lin-Li
    Yang, Sheng-Yong
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (10) : 2768 - 2777
  • [12] Ideaconsult Ltd, AMB SMIRKS
  • [13] AMBIT RESTful web services: an implementation of the OpenTox application programming interface
    Jeliazkova, Nina
    Jeliazkov, Vedrin
    [J]. JOURNAL OF CHEMINFORMATICS, 2011, 3
  • [14] NOMENCLATURE FOR ORGANIC-CHEMICAL TRANSFORMATIONS
    JONES, RAY
    BUNNETT, JF
    [J]. PURE AND APPLIED CHEMISTRY, 1989, 61 (04) : 725 - 768
  • [15] Efficient Syntheses of Diverse, Medicinally Relevant Targets Planned by Computer and Executed in the Laboratory
    Klucznik, Tomasz
    Mikulak-Klucznik, Barbara
    McCormack, Michael P.
    Lima, Heather
    Szymkuc, Sara
    Bhowmick, Manishabrata
    Molga, Karol
    Zhou, Yubai
    Rickershauser, Lindsey
    Gajewska, Ewa P.
    Toutchkine, Alexei
    Dittwald, Piotr
    Startek, Michal P.
    Kirkovits, Gregory J.
    Roszak, Rafal
    Adamski, Ariel
    Sieredzinska, Bianka
    Mrksich, Milan
    Trice, Sarah L. J.
    Grzybowski, Bartosz A.
    [J]. CHEM, 2018, 4 (03): : 522 - 532
  • [16] Distributed heuristic synthesis search
    Krebsbach, D
    Gelernter, H
    Sieburth, SM
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (04): : 595 - 604
  • [17] Route Designer: A Retrosynthetic Analysis Tool Utilizing Automated Retrosynthetic Rule Generation
    Law, James
    Zsoldos, Zsolt
    Simon, Aniko
    Reid, Darryl
    Liu, Yang
    Khew, Sing Yoon
    Johnson, A. Peter
    Major, Sarah
    Wade, Robert A.
    Ando, Howard Y.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (03) : 593 - 602
  • [18] Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models
    Liu, Bowen
    Ramsundar, Bharath
    Kawthekar, Prasad
    Shi, Jade
    Gomes, Joseph
    Quang Luu Nguyen
    Ho, Stephen
    Sloane, Jack
    Wender, Paul
    Pande, Vijay
    [J]. ACS CENTRAL SCIENCE, 2017, 3 (10) : 1103 - 1113
  • [19] Multistep Reaction Based De Novo Drug Design: Generating Synthetically Feasible Design Ideas
    Masek, Brian B.
    Baker, David S.
    Dorfman, Roman J.
    DuBrucq, Karen
    Francis, Victoria C.
    Nagy, Stephan
    Richey, Bree L.
    Soltanshahi, Farhad
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (04) : 605 - 620
  • [20] A novel approach to retrosynthetic analysis using knowledge bases derived from reaction databases
    Satoh, K
    Funatsu, K
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (02): : 316 - 325