ChemMine tools: an online service for analyzing and clustering small molecules

被引：357

作者：

Backman, Tyler W. H. ^{[1
]}

Cao, Yiqun ^{[2
]}

Girke, Thomas ^{[1
]}

机构：

[1] Univ Calif Riverside, Dept Bot & Plant Sci, Riverside, CA 92521 USA

[2] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA

来源：

NUCLEIC ACIDS RESEARCH | 2011年 / 39卷

基金：

美国国家科学基金会;

关键词：

PROTEIN-LIGAND COMPLEXES; ACCESSIBLE DATABASE; LIBRARY; DESCRIPTORS; AFFINITIES; RESOURCE; GENOMICS; DRUGS;

D O I：

10.1093/nar/gkr320

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

ChemMine Tools is an online service for small molecule data analysis. It provides a web interface to a set of cheminformatics and data mining tools that are useful for various analysis routines performed in chemical genomics and drug discovery. The service also offers programmable access options via the R library ChemmineR. The primary functionalities of ChemMine Tools fall into five major application areas: data visualization, structure comparisons, similarity searching, compound clustering and prediction of chemical properties. First, users can upload compound data sets to the online Compound Workbench. Numerous utilities are provided for compound viewing, structure drawing and format interconversion. Second, pairwise structural similarities among compounds can be quantified. Third, interfaces to ultra-fast structure similarity search algorithms are available to efficiently mine the chemical space in the public domain. These include fingerprint and embedding/indexing algorithms. Fourth, the service includes a Clustering Toolbox that integrates cheminformatic algorithms with data mining utilities to enable systematic structure and activity based analyses of custom compound sets. Fifth, physicochemical property descriptors of custom compound sets can be calculated. These descriptors are important for assessing the bioactivity profile of compounds in silico and quantitative structure-activity relationship (QSAR) analyses. ChemMine Tools is available at:http://chemmine.ucr.edu

引用

页码：W486 / W491

页数：6

共 45 条

[1] NIH Molecular Libraries Initiative
Austin, CP
Brady, LS
Insel, TR
Collins, FS
[J]. SCIENCE, 2004, 306 (5699) : 1138 - 1139
[2] Berthold M.R., 2007, KNIME: The Konstanz Information Miner
[3] AffinDB: a freely accessible database of affinities for protein-ligand complexes from the PDB
Block, Peter
Sotriffer, Christoph A.
Dramburg, Ingo
Klebe, Gerhard
[J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D522 - D526
[4] ChemmineR: a compound mining framework for R
Cao, Yiqun
Charisi, Anna
Cheng, Li-Chang
Jiang, Tao
Girke, Thomas
[J]. BIOINFORMATICS, 2008, 24 (15) : 1733 - 1734
[5] A maximum common substructure-based algorithm for searching and predicting drug-like compounds
Cao, Yiqun
Jiang, Tao
Girke, Thomas
[J]. BIOINFORMATICS, 2008, 24 (13) : I366 - I374
[6] Accelerated similarity searching and clustering of large compound sets by geometric embedding and locality sensitive hashing
Cao, Yiqun
Jiang, Tao
Girke, Thomas
[J]. BIOINFORMATICS, 2010, 26 (07) : 953 - 959
[7] ChemDB update - full-text search and virtual chemical space
Chen, Jonathan H.
Linstead, Erik
Swamidass, S. Joshua
Wang, Dennis
Baldi, Pierre
[J]. BIOINFORMATICS, 2007, 23 (17) : 2348 - 2351
[8] Performance of similarity measures in 2D fragment-based similarity searching: Comparison of structural descriptors and similarity coefficients
Chen, X
Reynolds, CH
[J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06): : 1407 - 1414
[9] Molecular medicine - NIH dives into drug discovery
Couzin, J
[J]. SCIENCE, 2003, 302 (5643) : 218 - +
[10] ChEBI:: a database and ontology for chemical entities of biological interest
Degtyarenko, Kirill
de Matos, Paula
Ennis, Marcus
Hastings, Janna
Zbinden, Martin
McNaught, Alan
Alcantara, Rafael
Darsow, Michael
Guedj, Mickael
Ashburner, Michael
[J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D344 - D350

← 1 2 3 4 5 →