ChemMine tools: an online service for analyzing and clustering small molecules

被引:357
作者
Backman, Tyler W. H. [1 ]
Cao, Yiqun [2 ]
Girke, Thomas [1 ]
机构
[1] Univ Calif Riverside, Dept Bot & Plant Sci, Riverside, CA 92521 USA
[2] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
基金
美国国家科学基金会;
关键词
PROTEIN-LIGAND COMPLEXES; ACCESSIBLE DATABASE; LIBRARY; DESCRIPTORS; AFFINITIES; RESOURCE; GENOMICS; DRUGS;
D O I
10.1093/nar/gkr320
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
ChemMine Tools is an online service for small molecule data analysis. It provides a web interface to a set of cheminformatics and data mining tools that are useful for various analysis routines performed in chemical genomics and drug discovery. The service also offers programmable access options via the R library ChemmineR. The primary functionalities of ChemMine Tools fall into five major application areas: data visualization, structure comparisons, similarity searching, compound clustering and prediction of chemical properties. First, users can upload compound data sets to the online Compound Workbench. Numerous utilities are provided for compound viewing, structure drawing and format interconversion. Second, pairwise structural similarities among compounds can be quantified. Third, interfaces to ultra-fast structure similarity search algorithms are available to efficiently mine the chemical space in the public domain. These include fingerprint and embedding/indexing algorithms. Fourth, the service includes a Clustering Toolbox that integrates cheminformatic algorithms with data mining utilities to enable systematic structure and activity based analyses of custom compound sets. Fifth, physicochemical property descriptors of custom compound sets can be calculated. These descriptors are important for assessing the bioactivity profile of compounds in silico and quantitative structure-activity relationship (QSAR) analyses. ChemMine Tools is available at:http://chemmine.ucr.edu
引用
收藏
页码:W486 / W491
页数:6
相关论文
共 45 条
  • [1] NIH Molecular Libraries Initiative
    Austin, CP
    Brady, LS
    Insel, TR
    Collins, FS
    [J]. SCIENCE, 2004, 306 (5699) : 1138 - 1139
  • [2] Berthold M.R., 2007, KNIME: The Konstanz Information Miner
  • [3] AffinDB: a freely accessible database of affinities for protein-ligand complexes from the PDB
    Block, Peter
    Sotriffer, Christoph A.
    Dramburg, Ingo
    Klebe, Gerhard
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D522 - D526
  • [4] ChemmineR: a compound mining framework for R
    Cao, Yiqun
    Charisi, Anna
    Cheng, Li-Chang
    Jiang, Tao
    Girke, Thomas
    [J]. BIOINFORMATICS, 2008, 24 (15) : 1733 - 1734
  • [5] A maximum common substructure-based algorithm for searching and predicting drug-like compounds
    Cao, Yiqun
    Jiang, Tao
    Girke, Thomas
    [J]. BIOINFORMATICS, 2008, 24 (13) : I366 - I374
  • [6] Accelerated similarity searching and clustering of large compound sets by geometric embedding and locality sensitive hashing
    Cao, Yiqun
    Jiang, Tao
    Girke, Thomas
    [J]. BIOINFORMATICS, 2010, 26 (07) : 953 - 959
  • [7] ChemDB update - full-text search and virtual chemical space
    Chen, Jonathan H.
    Linstead, Erik
    Swamidass, S. Joshua
    Wang, Dennis
    Baldi, Pierre
    [J]. BIOINFORMATICS, 2007, 23 (17) : 2348 - 2351
  • [8] Performance of similarity measures in 2D fragment-based similarity searching: Comparison of structural descriptors and similarity coefficients
    Chen, X
    Reynolds, CH
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06): : 1407 - 1414
  • [9] Molecular medicine - NIH dives into drug discovery
    Couzin, J
    [J]. SCIENCE, 2003, 302 (5643) : 218 - +
  • [10] ChEBI:: a database and ontology for chemical entities of biological interest
    Degtyarenko, Kirill
    de Matos, Paula
    Ennis, Marcus
    Hastings, Janna
    Zbinden, Martin
    McNaught, Alan
    Alcantara, Rafael
    Darsow, Michael
    Guedj, Mickael
    Ashburner, Michael
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D344 - D350