Partitioning methods for the identification of active molecules

被引:23
作者
Stahura, FL
Bajorath, J
机构
[1] Albany Mol Res Inc, Dept Comp Aided Drug Discovery, Bothell Res Ctr, Bothell, WA 98011 USA
[2] Univ Washington, Dept Biol Struct, Seattle, WA 98195 USA
关键词
molecular descriptors; chemical spaces; clustering and partitioning; compound classification; activity-based selection; database mining; virtual screening; drug discovery;
D O I
10.2174/0929867033457881
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The dramatically increasing number of compounds that become available for biological evaluation presents a significant challenge for database design, management, and mining. Computational approaches for screening, profiling, or filtering of large compound collections are by now widely used in pharmaceutical research. Among popular compound classification and database mining techniques, partitioning methods are computationally very efficient and particularly suitable for the analysis of increasingly large molecular databases, as they do not depend on pair-wise comparisons of compounds to assess molecular similarity or diversity. Promising applications of partitioning algorithms include diversity selection, searching for compounds with desired biological activity, or the derivation of predictive models from screening datasets. Compound partitioning is introduced here in the context of virtual screening and different partitioning methods are discussed that operate in low-dimensional or other chemical descriptor spaces, including a number of practical drug-discovery-related applications.
引用
收藏
页码:707 / 715
页数:9
相关论文
共 63 条
[41]   Clustering of large databases of compounds: Using the MDL ''keys'' as structural descriptors [J].
McGregor, MJ ;
Pallai, PV .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (03) :443-448
[42]   Rational screening set design and compound selection: Cascaded clustering [J].
Menard, PR ;
Lewis, RA ;
Mason, JS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (03) :497-505
[43]  
Miller DW, 2001, J CHEM INF COMP SCI, V41, P168, DOI [10.1021/ci0003348, 10.1021/ci0000348]
[44]   Analysis of large screening data sets via adaptively grown phylogenetic-like trees [J].
Nicolaou, CA ;
Tamura, SY ;
Kelley, BP ;
Bassett, SI ;
Nutt, RF .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (05) :1069-1079
[45]   Metric validation and the receptor-relevant subspace concept [J].
Pearlman, RS ;
Smith, KM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (01) :28-35
[46]   Novel software tools for chemical diversity [J].
Pearlman, RS ;
Smith, KM .
PERSPECTIVES IN DRUG DISCOVERY AND DESIGN, 1998, 9-11 :339-353
[47]   Classification of kinase inhibitors using BCUT descriptors [J].
Pirard, B ;
Pickett, SD .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (06) :1431-1440
[48]  
POLISKI I, 2001, DRUG DISCOV DEV JUN, P40
[49]   Analysis of a large structure/biological activity data set using recursive partitioning [J].
Rusinko, A ;
Farmen, MW ;
Lambert, CG ;
Brown, PL ;
Young, SS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (06) :1017-1026
[50]   Design and diversity analysis of large combinatorial libraries using cell-based methods [J].
Schnur, D .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (01) :36-45