Hit-directed nearest-neighbor searching

被引:46
作者
Shanmugasundaram, V [1 ]
Maggiora, GM [1 ]
Lajiness, MS [1 ]
机构
[1] Pharmacia Corp, Struct & Computat Chem, Kalamazoo, MI 49007 USA
关键词
D O I
10.1021/jm0493515
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
This work describes a practical strategy used at Pharmacia for identifying compounds for follow-up screening following an initial HTS campaign against targets where no 3-D structural information is available and preliminary SAR models do not exist. The approach explicitly takes into account different representations of chemistry space and identifies compounds for follow-up screening that are likely to provide the best overall coverage of the chemistry spaces considered. Specifically, the work employs hit-directed nearest-neighbor (HDNN) searching of compound databases based upon a set of "probe compounds" obtained as hits in the preliminary high-throughput screens. Four different molecular representations that generate nearly unique chemistry spaces are used. The representations include 3-D, 2-D, 2-D topological BCUTs (2-DT) and molecular fingerprints derived from substructural fragments. In the case of the BCUT representations the NN searching is distance based, while in the case of molecular fingerprints a similarity-based measure is used. Generally, the results obtained differ significantly among all four methods, that is, the sets of NN compounds have surprisingly little overlap. Moreover, in all of the four chemistry space representations, a minimum of 3- to 4-fold enrichment in actives over random screening is observed even though the actives identified in each of the sets of NNs are in large measure unique. These results suggest that use of multiple searches based upon a variety of molecular representations provides an effective way of identifying more hits in HDNN searches of chemistry spaces than can be realized with single searches.
引用
收藏
页码:240 / 248
页数:9
相关论文
共 37 条
[1]  
[Anonymous], MOL SIMILARITY DRUG
[2]   Integration of virtual and high-throughput screening [J].
Bajorath, F .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (11) :882-894
[3]   The design of combinatorial libraries using properties and 3D pharmacophore fingerprints [J].
Beno, BR ;
Mason, JS .
DRUG DISCOVERY TODAY, 2001, 6 (05) :251-258
[4]   MOLECULAR-IDENTIFICATION NUMBER FOR SUBSTRUCTURE SEARCHES [J].
BURDEN, FR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1989, 29 (03) :225-227
[5]  
Engels MFM, 2000, J CHEM INF COMP SCI, V40, P241, DOI 10.1021/ci990435
[6]  
Friedman J., 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5
[7]   Application of BCUT metrics and genetic algorithm in binary QSAR analysis [J].
Gao, H .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (02) :402-407
[8]   Combination of molecular similarity measures using data fusion [J].
Ginn, CMR ;
Willett, P ;
Bradshaw, J .
PERSPECTIVES IN DRUG DISCOVERY AND DESIGN, 2000, 20 (01) :1-16
[9]  
Guner OsmanF., 2000, PHARMACOPHORE PERCEP
[10]  
HAGADONE T. R., 1993, P 2 INT CHEM STRUCT, P257