Classification of biologically active compounds by median partitioning

被引:16
作者
Godden, JW
Xue, L
Bajorath, J
机构
[1] BRC, AMRI, Dept Comp Aided Drug Discovery, Bothell, WA 98011 USA
[2] Univ Washington, Dept Biol Struct, Seattle, WA 98195 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2002年 / 42卷 / 05期
关键词
D O I
10.1021/ci020372m
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The median partitioning (MP) method was originally developed for the selection of diverse subsets from compound databases. Following this approach, property descriptors are used id subsequent steps to divide compounds into defined partitions from which representative molecules are selected. For descriptor analysis, MP was coupled to a genetic algorithm. MP subset selection does not depend on pairwise comparison of molecules and is therefore applicable to very large compound pools. Here the MP approach was evaluated for the classification of molecules according to biological activity. A total of 317 molecules belonging to 21 different activity classes were studied. MP compound classification calculations were carried out both in the presence and absence of 2000 randomly selected "background" molecules. The performance of MP was compared to cell-based partitioning and found to be at least comparable, with up to approximately 82% of active molecules occurring in "pure" partitions consisting only of molecules sharing the same activity. Different from cell-based methods, MP classification is based on "direct" and "sequential" contributions of molecular property descriptors. Our results suggest that MP in not only an effective method for the selection of diverse subsets but also for the classification of active compounds and searching for molecules with desired activity.
引用
收藏
页码:1263 / 1269
页数:7
相关论文
共 27 条
[11]   Chemical descriptors with distinct levels of information content and varying sensitivity to differences between selected compound databases identified by SE-DSE analysis [J].
Godden, JW ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (01) :87-93
[12]   Median partitioning: A novel method for the selection of representative subsets from large compound pools [J].
Godden, JW ;
Xue, L ;
Kitchen, DB ;
Stahura, FL ;
Schermerhorn, EJ ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (04) :885-893
[13]  
Hall L. H., 1991, Reviews in Computational Chemistry, P367, DOI [10.1002/9780470125793.ch9, DOI 10.1002/9780470125793.CH9]
[14]   Latent semantic structure indexing (LaSSI) for defining chemical similarity [J].
Hull, RD ;
Singh, SB ;
Nachbar, RB ;
Sheridan, RP ;
Kearsley, SK ;
Fluder, EM .
JOURNAL OF MEDICINAL CHEMISTRY, 2001, 44 (08) :1177-1184
[15]   A widely applicable set of descriptors [J].
Labute, P .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2000, 18 (4-5) :464-477
[16]  
*MDL INF SYST INC, ACD AV CHEM DIR
[17]  
Meier P.C., 2000, STAT METHODS ANAL CH
[18]   Metric validation and the receptor-relevant subspace concept [J].
Pearlman, RS ;
Smith, KM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (01) :28-35
[19]   Novel software tools for chemical diversity [J].
Pearlman, RS ;
Smith, KM .
PERSPECTIVES IN DRUG DISCOVERY AND DESIGN, 1998, 9-11 :339-353
[20]   APPLICATIONS OF THE RADIUS DIAMETER DIAGRAM TO THE CLASSIFICATION OF TOPOLOGICAL AND GEOMETRICAL SHAPES OF CHEMICAL-COMPOUNDS [J].
PETITJEAN, M .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (04) :331-337