Comparison of algorithms for dissimilarity-based compound selection

被引:160
作者
Snarey, M [1 ]
Terrett, NK
Willett, P
Wilton, DJ
机构
[1] Pfizer Cent Res, Sandwich, Kent, England
[2] Univ Sheffield, Western Bank, Krebs Inst Biomolec Res, Sheffield, S Yorkshire, England
[3] Univ Sheffield, Western Bank, Dept Informat Studies, Sheffield, S Yorkshire, England
关键词
D O I
10.1016/S1093-3263(98)00008-4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Dissimilarity-based compound selection has been suggested as an effective method for selecting structural diverse subsets of chemical databases. This article reports a comparison of several maximum-dissimilarity and sphere-exclusion algorithms for dissimilarity-based selection. The effectiveness of the algorithms is quantified by the numbers of biological activity classes identified in subsets selected from the World Drugs Index database, and by the numbers of active compounds identified in feedback searches of this database. The experiments demonstrate the general effectiveness and efficiency of the MaxMin algorithm. (C) 1998 by Elsevier Science Inc.
引用
收藏
页码:372 / 385
页数:14
相关论文
共 30 条
[1]   On the use of information theory for assessing molecular diversity [J].
Agrafiotis, DK .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (03) :576-580
[2]  
[Anonymous], MOL SIMILARITY DRUG
[3]  
BAWDEN D, 1990, CONCEPTS AND APPLICATIONS OF MOLECULAR SIMILARITY, P65
[4]  
BAWDEN D, 1993, CHEM STRUCTURES, V2, P383
[5]   The information content of 2D and 3D structural descriptors relevant to ligand-receptor binding [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01) :1-9
[6]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[7]  
Downs G. M., 1996, REV COMP CH, V7, P1
[8]  
Ferguson A. M., 1996, J BIOMOL SCREEN, V1, P65
[9]   The effectiveness of reactant pools for generating structurally-diverse combinatorial libraries [J].
Gillet, VJ ;
Willett, P ;
Bradshaw, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (04) :731-740
[10]   A fast algorithm for selecting sets of dissimilar molecules from large chemical databases [J].
Holliday, JD ;
Ranade, SS ;
Willett, P .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1995, 14 (06) :501-506