Improving the odds in discriminating "Drug-like" from "Non Drug-like" compounds

被引:107
作者
Frimurer, TM
Bywater, R
Nærum, L
Lauritsen, LN
Brunak, S
机构
[1] Novo Nordisk AS, MedChem Res, DK-2760 Malov, Denmark
[2] Tech Univ Denmark, Dept Biotechnol, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2000年 / 40卷 / 06期
关键词
D O I
10.1021/ci0003810
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We have used a feed-forward neural network technique to classify chemical compounds into potentially "drug-like" and "non drug-like" candidates. The neural network was trained to distinguish between a set of "drug-like" and "non drug-like" chemical compounds taken from the MACCS-II Drug Data Report (MDDR) and the Available Chemicals Directory (ACD). The 2D atom types (of the full atomic representation) were assigned and applied as descriptors to encode numerically each compound. There are four main conclusions: First the method performs well, correctly assigning 88% of the compounds in both MDDR and ACD. Improved discrimination was achieved by a more critical selection of training sets. Second, the method gives much better prediction performance than the widely used "Rule of Five", which accepts as many as 74% of the ACD compounds but only 66% of those in MDDR, resulting in a correlation coefficient which is effectively zero, compared to a value of 0.63 for the neural network prediction. Third, based on a standard Tanimoto similarity search the selection of drug-like compounds in the evaluation set is not biased toward compounds similar to those in the training set. Fourth, the trained neural network was applied to evaluate the drug-likeness of 136 GABA uptake inhibitors with impressive results. The implications of applying a neural network to characterize chemical compounds are discussed.
引用
收藏
页码:1315 / 1324
页数:10
相关论文
共 23 条
[1]   Can we learn to distinguish between "drug-like" and "nondrug-like" molecules? [J].
Ajay ;
Walters, WP ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1998, 41 (18) :3314-3324
[2]   APPLICATIONS OF NEURAL NETWORKS IN QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS OF DIHYDROFOLATE-REDUCTASE INHIBITORS [J].
ANDREA, TA ;
KALAYEH, H .
JOURNAL OF MEDICINAL CHEMISTRY, 1991, 34 (09) :2824-2836
[3]  
[Anonymous], 1996, MOL MODELLING PRINCI
[4]  
Baldi P., 1998, Bioinformatics: The machine learning approach
[5]   PREDICTION OF HUMAN MESSENGER-RNA DONOR AND ACCEPTOR SITES FROM THE DNA-SEQUENCE [J].
BRUNAK, S ;
ENGELBRECHT, J ;
KNUDSEN, S .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 220 (01) :49-65
[6]   A pattern classification procedure integrating the multivariate statistical analysis with neural networks [J].
Chen, DZ ;
Chen, YQ ;
Hu, SX .
COMPUTERS & CHEMISTRY, 1997, 21 (02) :109-113
[7]  
DOUCET JP, 1996, MODELING COMPLEX DAT, P79
[8]   APPLICATIONS OF COMBINATORIAL TECHNOLOGIES TO DRUG DISCOVERY .1. BACKGROUND AND PEPTIDE COMBINATORIAL LIBRARIES [J].
GALLOP, MA ;
BARRETT, RW ;
DOWER, WJ ;
FODOR, SPA ;
GORDON, EM .
JOURNAL OF MEDICINAL CHEMISTRY, 1994, 37 (09) :1233-1251
[9]   Identification of biological activity profiles using substructural analysis and genetic algorithms [J].
Gillet, VJ ;
Willett, P ;
Bradshaw, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (02) :165-179
[10]   APPLICATIONS OF COMBINATORIAL TECHNOLOGIES TO DRUG DISCOVERY .2. COMBINATORIAL ORGANIC-SYNTHESIS, LIBRARY SCREENING STRATEGIES, AND FUTURE-DIRECTIONS [J].
GORDON, EM ;
BARRETT, RW ;
DOWER, WJ ;
FODOR, SPA ;
GALLOP, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1994, 37 (10) :1385-1401