Evolving interpretable structure - Activity relationships. 1. Reduced graph queries

被引:13
作者
Birchall, Kristian [1 ]
Gillet, Valerie J. [1 ]
Harper, Gavin [2 ]
Pickett, Stephen D. [2 ]
机构
[1] Univ Sheffield, Dept Informat Studies, Sheffield S1 4DP, S Yorkshire, England
[2] GlaxoSmithKline, Med Res Ctr, Stevenage SG1 2NY, Herts, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1021/ci8000502
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
A new machine learning method is presented for extracting interpretable structure-activity relationships from screening data. The method is based on an evolutionary algorithm and reduced graphs and aims to evolve a reduced graph query (subgraph) that is present within the active compounds and absent from the inactives. The reduced graph representation enables heterogeneous Compounds, such as those found in high-throughput screening data, to be captured in a single representation with the resulting query encoding structure-activity information in a form that is readily interpretable by a chemist. The application of the method is illustrated using data sets extracted from the well-known MDDR data set and GSK in-house screening data. Queries are evolved that are consistent with the known SARs, and they are also shown to be robust when applied to independent sets that were not used in training.
引用
收藏
页码:1543 / 1557
页数:15
相关论文
共 36 条
[1]   A model for identifying HERG K+ channel blockers [J].
Aronov, AM ;
Goldman, BB .
BIOORGANIC & MEDICINAL CHEMISTRY, 2004, 12 (09) :2307-2315
[2]   Scaffold hopping using clique detection applied to reduced graphs [J].
Barker, EJ ;
Buttar, D ;
Cosgrove, DA ;
Gardiner, EJ ;
Kitts, P ;
Willett, P ;
Gillet, VJ .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) :503-511
[3]   The properties of known drugs .1. Molecular frameworks [J].
Bemis, GW ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) :2887-2893
[4]   Training similarity measures for specific activities: Application to reduced graphs [J].
Birchall, K ;
Gillet, VJ ;
Harper, G ;
Pickett, SD .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) :577-586
[5]   Evolving interpretable structure - Activity relationship models. 2. Using multiobjective optimization to derive multiple models [J].
Birchall, Kristian ;
Gillet, Valerie J. ;
Harper, Gavin ;
Pickentt, Stephen D. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (08) :1558-1570
[6]   The de novo design of median molecules within a property range of interest [J].
Brown, N ;
McKay, B ;
Gasteiger, J .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2004, 18 (12) :761-771
[7]   Contemporary QSAR classifiers compared [J].
Bruce, Craig L. ;
Melville, James L. ;
Pickett, Stephen D. ;
Hirst, Jonathan D. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (01) :219-227
[8]   Support vector inductive logic programming outperforms the naive Bayes classifier and inductive logic programming for the classification of bioactive chemical compounds [J].
Cannon, Edward O. ;
Amini, Ata ;
Bender, Andreas ;
Sternberg, Michael J. E. ;
Muggleton, Stephen H. ;
Glen, Robert C. ;
Mitchell, John B. O. .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2007, 21 (05) :269-280
[9]   Virtual screening using binary kernel discrimination: Effect of noisy training data and the optimization of performance [J].
Chen, BN ;
Harrison, RF ;
Pasupa, K ;
Willett, P ;
Wilton, DJ ;
Wood, DJ ;
Lewell, XQ .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) :478-486
[10]   Ligand-5-HT1A receptor interaction [J].
Chilmonczyk, Z .
FARMACO, 2000, 55 (03) :191-193