Recursive median partitioning for virtual screening of large databases

被引:30
作者
Godden, JW
Furr, JR
Bajorath, J
机构
[1] Albany Mol Res Inc, Dept Comp Aided Drug Discovery, Albany, NY 12212 USA
[2] AMRI Bothell Res Ctr, Bothell, WA 98011 USA
[3] Univ Washington, Dept Biol Struct, Seattle, WA 98195 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2003年 / 43卷 / 01期
关键词
D O I
10.1021/ci0203848
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Recently, we have introduced the median partitioning (MP) method for diversity selection and compound classification. The MP approach utilizes property descriptors with continuous value ranges. transforms these descriptors into a binary classification scheme by determining their medians in source databases, and divides database molecules in subsequent steps into populations above or below these medians. Having previously demonstrated the usefulness of MP for the classification of molecules according to biological activity. we have now gone a step further and extended the methodology for application in virtual screening. In these calculations, a series of bait molecules having desired activity is added to large compound databases, and subsequent iterations or recursions are carried out to reduce the number of candidate molecules until a small number of compounds are found in partitions enriched with bait molecules. For each recursion step, descriptor combinations are identified that copartition as many active molecules as possible. Descriptor selection is facilitated by application of a genetic algorithm (GA). The recursive MP approach (RMP) has been applied to five diverse biological activity classes in virtual screening of a database consisting of approximately 1.34 million molecules to which different types of active compounds were added. RMP analysis produced hit rates of up to 21%, dependent on the biological activity class, and led to an average similar to3600-fold improvement over random selection for the activity classes that were used as test cases.
引用
收藏
页码:182 / 188
页数:7
相关论文
共 24 条
[1]   Nonlinear mapping networks [J].
Agrafiotis, DK ;
Lobanov, VS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (06) :1356-1362
[2]   Multidimensional scaling of combinatorial libraries without explicit enumeration [J].
Agrafiotis, DK ;
Lobanov, VS .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2001, 22 (14) :1712-1722
[3]   On combining recursive partitioning and simulated annealing to detect groups of biologically active compounds [J].
Blower, P ;
Fligner, M ;
Verducci, J ;
Bjoraker, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (02) :393-404
[4]  
*CHEM COMP GROUP I, MOE VERS 2001 01
[5]   Recursive partitioning analysis of a large structure-activity data set using three-dimensional descriptors [J].
Chen, X ;
Rusinko, A ;
Young, SS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06) :1054-1062
[6]   Molecular docking and high-throughput screening for novel inhibitors of protein tyrosine phosphatase-1B [J].
Doman, TN ;
McGovern, SL ;
Witherbee, BJ ;
Kasten, TP ;
Kurumbail, R ;
Stallings, WC ;
Connolly, DT ;
Shoichet, BK .
JOURNAL OF MEDICINAL CHEMISTRY, 2002, 45 (11) :2213-2221
[7]  
Engels M F, 2001, Curr Opin Drug Discov Devel, V4, P275
[8]   Chemical descriptors with distinct levels of information content and varying sensitivity to differences between selected compound databases identified by SE-DSE analysis [J].
Godden, JW ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (01) :87-93
[9]   Median partitioning: A novel method for the selection of representative subsets from large compound pools [J].
Godden, JW ;
Xue, L ;
Kitchen, DB ;
Stahura, FL ;
Schermerhorn, EJ ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (04) :885-893
[10]   Classification of biologically active compounds by median partitioning [J].
Godden, JW ;
Xue, L ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (05) :1263-1269