Genetic programming for QSAR investigation of docking energy

被引:13
作者
Archetti, Francesco [1 ,2 ]
Giordani, Ilaria [1 ,3 ]
Vanneschi, Leonardo [1 ]
机构
[1] Univ Milanobicocca, Dipartimento Informat Sistemist & Comunicaz, I-20126 Milan, Italy
[2] Consorzio Milano Ric, I-20126 Milan, Italy
[3] DELOS Srl, I-20091 Milan, Italy
关键词
Genetic Programming; Machine learning; Regression; Docking energy; Computational biology; Drug design; QSAR; ORAL BIOAVAILABILITY; FEATURE-SELECTION; PREDICTION; TOXICITY;
D O I
10.1016/j.asoc.2009.06.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Statistical methods, and in particular Machine Learning, have been increasingly used in the drug development workflow to accelerate the discovery phase and to eliminate possible failures early during clinical developments. In the past, the authors of this paper have been working specifically on two problems: (i) prediction of drug induced toxicity and (ii) evaluation of the target-drug chemical interaction based on chemical descriptors. Among the numerous existing Machine Learning methods and their application to drug development ( see for instance [ F. Yoshida, J.G. Topliss, QSAR model for drug human oral bioavailability, Journal of Medicinal Chemistry 43 (2000) 2575-2585; Frohlich, J. Wegner, F. Sieker, A. Zell, Kernel functions for attributed molecular graphs-a new similarity based approach to ADME prediction in classification and regression, QSAR and Combinatorial Science, 38( 4) ( 2003) 427431; C. W. Andrews, L. Bennett, L. X. Yu, Predicting human oral bioavailability of a compound: development of a novel quantitative structure-bioavailability relationship, Pharmacological Research 17 ( 2000) 639-644; J Feng, L. Lurati, H. Ouyang, T. Robinson, Y. Wang, S. Yuan, S. S. Young, Predictive toxicology: benchmarking molecular descriptors and statistical methods, Journal of Chemical Information Computer Science 43 ( 2003) 1463-1470; T. M. Martin, D. M. Young, Prediction of the acute toxicity (96-h LC50) of organic compounds to the fat head minnow (Pimephales promelas) using a group contribution method, Chemical Research in Toxicology 14( 10) ( 2001) 1378-1385; G. Colmenarejo, A. Alvarez-Pedraglio, J. L. Lavandera, Chemoinformatic models to predict binding affinities to human serum albumin, Journal of Medicinal Chemistry 44 ( 2001) 4370-4378; J. Zupan, P. Gasteiger, Neural Networks in Chemistry and Drug Design: An Introduction, 2nd edition, Wiley, 1999]), we have been specifically concerned with Genetic Programming. A first paper [F. Archetti, E. Messina, S. Lanzeni, L. Vanneschi, Genetic programming for computational pharmacokinetics in drug discovery and development, Genetic Programming and Evolvable Machines 8( 4) ( 2007) 17-26] has been devoted to problem ( i). The present contribution aims at developing a Genetic Programming based framework on which to build specific strategies which are then shown to be a valuable tool for problem ( ii). In this paper, we use target estrogen receptor molecules and genistein based drug compounds. Being able to precisely and efficiently predict their mutual interaction energy is a very important task: for example, it may have an immediate relationship with the efficacy of genistein based drugs in menopause therapy and also as a natural prevention of some tumors. We compare the experimental results obtained by Genetic Programming with the ones of a set of "non-evolutionary'' Machine Learning methods, including Support Vector Machines, Artificial Neural Networks, Linear and Least Square Regression. Experimental results confirm that Genetic Programming is a promising technique from the viewpoint of the accuracy of the proposed solutions, of the generalization ability and of the correlation between predicted data and correct ones. (C) 2009 Elsevier B. V. All rights reserved.
引用
收藏
页码:170 / 182
页数:13
相关论文
共 47 条
  • [1] *ACC INC, 2006, WORLD LEAD CHEM DRUG
  • [2] Akaike H., 1998, Selected papers of Hirotugu Akaike, P199, DOI DOI 10.1007/978-1-4612-1694-0_15
  • [3] ALEX JS, 1998, TECHNICAL REPORT TEC
  • [4] Predicting human oral bioavailability of a compound: Development of a novel quantitative structure-bioavailability relationship
    Andrews, CW
    Bennett, L
    Yu, LX
    [J]. PHARMACEUTICAL RESEARCH, 2000, 17 (06) : 639 - 644
  • [5] [Anonymous], 1998, Genetic programming: an introduction: on the automatic evolution of computer programs and its applications
  • [6] Archetti F, 2007, GENETIC PROGRAMMING, V8, P17
  • [7] Archetti F, 2006, GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, P255
  • [8] BANLEY JM, 1993, PERSPECT DRUG DISCOV, V1, P301
  • [9] CHIAPPORI F, 2005, P BIOINF IT SOC BITS
  • [10] Cheminformatic models to predict binding affinities to human serum albumin
    Colmenarejo, G
    Alvarez-Pedraglio, A
    Lavandera, JL
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2001, 44 (25) : 4370 - 4378