Evolving interpretable structure - Activity relationships. 1. Reduced graph queries

被引:13
作者
Birchall, Kristian [1 ]
Gillet, Valerie J. [1 ]
Harper, Gavin [2 ]
Pickett, Stephen D. [2 ]
机构
[1] Univ Sheffield, Dept Informat Studies, Sheffield S1 4DP, S Yorkshire, England
[2] GlaxoSmithKline, Med Res Ctr, Stevenage SG1 2NY, Herts, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1021/ci8000502
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
A new machine learning method is presented for extracting interpretable structure-activity relationships from screening data. The method is based on an evolutionary algorithm and reduced graphs and aims to evolve a reduced graph query (subgraph) that is present within the active compounds and absent from the inactives. The reduced graph representation enables heterogeneous Compounds, such as those found in high-throughput screening data, to be captured in a single representation with the resulting query encoding structure-activity information in a form that is readily interpretable by a chemist. The application of the method is illustrated using data sets extracted from the well-known MDDR data set and GSK in-house screening data. Queries are evolved that are consistent with the known SARs, and they are also shown to be robust when applied to independent sets that were not used in training.
引用
收藏
页码:1543 / 1557
页数:15
相关论文
共 36 条
[11]   A rapid computational filter for cytochrome P450 1A2 inhibition potential of compound libraries [J].
Chohan, KK ;
Paine, SW ;
Mistry, J ;
Barton, P ;
Davis, AM .
JOURNAL OF MEDICINAL CHEMISTRY, 2005, 48 (16) :5154-5161
[12]   OPTIMIZATION AND INVIVO EVALUATIONS OF A SERIES OF SMALL, POTENT, AND SPECIFIC RENIN INHIBITORS CONTAINING A NOVEL LEU-VAL REPLACEMENT [J].
DELLARIA, JF ;
MAKI, RG ;
BOPP, BA ;
COHEN, J ;
KLEINERT, HD ;
LULY, JR ;
MERITS, I ;
PLATTNER, JJ ;
STEIN, HH .
JOURNAL OF MEDICINAL CHEMISTRY, 1987, 30 (11) :2137-2144
[13]  
Ekins S, 1999, PHARMACOGENETICS, V9, P477
[14]   Representing clusters using a maximum common edge substructure algorithm applied to reduced graphs and molecular graphs [J].
Gardiner, Eleanor J. ;
Gillet, Valerie J. ;
Willett, Peter ;
Cosgrove, David A. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (02) :354-366
[15]   Similarity searching using reduced graphs [J].
Gillet, VJ ;
Willett, P ;
Bradshaw, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02) :338-345
[16]   Enrichment of high-throughput screening data with increasing levels of noise using support vector machines, recursive partitioning, and Laplacian-modified naive Bayesian classifiers [J].
Glick, M ;
Jenkins, JL ;
Nettles, JH ;
Hitchings, H ;
Davies, JW .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (01) :193-200
[17]   Automatic molecular design using evolutionary techniques [J].
Globus, A ;
Lawton, J ;
Wipke, T .
NANOTECHNOLOGY, 1999, 10 (03) :290-299
[18]  
Gusfield D., 1997, ALGORITHMS STRINGS T
[19]   The reduced graph descriptor in virtual screening and data-driven clustering of high-throughput screening data [J].
Harper, G ;
Bravi, GS ;
Pickett, SD ;
Hussain, J ;
Green, DVS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (06) :2145-2156
[20]   Methods for mining HTS data [J].
Harper, Gavin ;
Pickett, Stephen D. .
DRUG DISCOVERY TODAY, 2006, 11 (15-16) :694-699