Balancing representativeness against diversity using optimizable K-dissimilarity and hierarchical clustering

被引:25
作者
Clark, RD [1 ]
Langton, WJ [1 ]
机构
[1] Tripos Inc, St Louis, MO 63144 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1998年 / 38卷 / 06期
关键词
D O I
10.1021/ci980107u
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
When assessing the pharmacological potential of large libraries of compounds, it is often useful to start by determining the biochemical activities of some subset thereof. This is so whether the compounds in question have in fact already been synthesized or exist solely as virtual libraries. A suitable subset for this task must be structurally diverse, so as to minimize redundant testing, but must also be representative, so that valuable subgroups do not get overlooked. These two needs are intrinsically in conflict, with gains in one necessarily coming at the expense of the other. Results obtained using optimizable K-dissimilarity selection and clustering are described and compared with those obtained using more traditional agglomerative hierarchical clustering techniques.
引用
收藏
页码:1079 / 1086
页数:8
相关论文
共 19 条
[1]   CLUSTERING OF CHEMICAL STRUCTURES ON THE BASIS OF 2-DIMENSIONAL SIMILARITY MEASURES [J].
BARNARD, JM ;
DOWNS, GM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (06) :644-649
[2]   The information content of 2D and 3D structural descriptors relevant to ligand-receptor binding [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01) :1-9
[3]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[4]   OptiSim: An extended dissimilarity selection method for finding diverse representative subsets [J].
Clark, RD .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (06) :1181-1188
[5]  
CLARK RD, 1998, 3D QSAR DRUG DESIGN, V2, P211
[6]   Bioisosterism as a molecular diversity descriptor: Steric fields of single ''topomeric'' conformers [J].
Cramer, RD ;
Clark, RD ;
Patterson, DE ;
Ferguson, AM .
JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (16) :3060-3069
[7]  
Gower J. C., 1985, Encyclopedia of statistical sciences, VVol. 5, P397
[8]  
Holliday JD, 1996, SLAS DISCOV, V1, P145
[9]  
Kaufman L., 1990, FINDING GROUP DATA I, P230
[10]   A GENERAL THEORY OF CLASSIFICATORY SORTING STRATEGIES .1. HIERARCHICAL SYSTEMS [J].
LANCE, GN ;
WILLIAMS, WT .
COMPUTER JOURNAL, 1967, 9 (04) :373-&