Classification of biologically active compounds by median partitioning

被引:16
作者
Godden, JW
Xue, L
Bajorath, J
机构
[1] BRC, AMRI, Dept Comp Aided Drug Discovery, Bothell, WA 98011 USA
[2] Univ Washington, Dept Biol Struct, Seattle, WA 98195 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2002年 / 42卷 / 05期
关键词
D O I
10.1021/ci020372m
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The median partitioning (MP) method was originally developed for the selection of diverse subsets from compound databases. Following this approach, property descriptors are used id subsequent steps to divide compounds into defined partitions from which representative molecules are selected. For descriptor analysis, MP was coupled to a genetic algorithm. MP subset selection does not depend on pairwise comparison of molecules and is therefore applicable to very large compound pools. Here the MP approach was evaluated for the classification of molecules according to biological activity. A total of 317 molecules belonging to 21 different activity classes were studied. MP compound classification calculations were carried out both in the presence and absence of 2000 randomly selected "background" molecules. The performance of MP was compared to cell-based partitioning and found to be at least comparable, with up to approximately 82% of active molecules occurring in "pure" partitions consisting only of molecules sharing the same activity. Different from cell-based methods, MP classification is based on "direct" and "sequential" contributions of molecular property descriptors. Our results suggest that MP in not only an effective method for the selection of diverse subsets but also for the classification of active compounds and searching for molecules with desired activity.
引用
收藏
页码:1263 / 1269
页数:7
相关论文
共 27 条
[1]  
[Anonymous], MACCS KEYS
[2]  
[Anonymous], MOE MOL OP ENV VERS
[3]   Selected concepts and investigations in compound classification, molecular descriptor analysis, and virtual screening [J].
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (02) :233-245
[4]   Topological indices: Their nature and mutual relatedness [J].
Basak, SC ;
Balaban, AT ;
Grunwald, GD ;
Gute, BD .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (04) :891-898
[5]   Binning schemes for partition-based compound selection [J].
Bayley, MJ ;
Willett, P .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 1999, 17 (01) :10-18
[6]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[7]   GENETIC ALGORITHMS - PRINCIPLES OF NATURAL-SELECTION APPLIED TO COMPUTATION [J].
FORREST, S .
SCIENCE, 1993, 261 (5123) :872-878
[8]  
FRIEDMAN JH, 1977, IEEE T COMPUT, V26, P404, DOI 10.1109/TC.1977.1674849
[9]   ITERATIVE PARTIAL EQUALIZATION OF ORBITAL ELECTRONEGATIVITY - A RAPID ACCESS TO ATOMIC CHARGES [J].
GASTEIGER, J ;
MARSILI, M .
TETRAHEDRON, 1980, 36 (22) :3219-3228
[10]   Variability of molecular descriptors in compound databases revealed by Shannon entropy calculations [J].
Godden, JW ;
Stahura, FL ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (03) :796-800