A novel logic-based approach for quantitative toxicology prediction

被引:34
作者
Amini, Ata
Muggleton, Stephen H.
Lodhi, Huma
Sternberg, Michael J. E. [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Div Mol Biosci, Struct Bioinformat Grp, Ctr Bioinformat, London SW7 2AZ, England
[2] Univ London Imperial Coll Sci Technol & Med, Dept Comp, Computat Bioinformat Lab, London SW7 2AZ, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1021/ci600223d
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
There is a pressing need for accurate in silico methods to predict the toxicity of molecules that are being introduced into the environment or are being developed into new pharmaceuticals. Predictive toxicology is in the realm of structure activity relationships ( SAR), and many approaches have been used to derive such SAR. Previous work has shown that inductive logic programming (ILP) is a powerful approach that circumvents several major difficulties, such as molecular superposition, faced by some other SAR methods. The ILP approach reasons with chemical substructures within a relational framework and yields chemically understandable rules. Here, we report a general new approach, support vector inductive logic programming (SVILP), which extends the essentially qualitative ILP-based SAR to quantitative modeling. First, ILP is used to learn rules, the predictions of which are then used within a novel kernel to derive a support-vector generalization model. For a highly heterogeneous dataset of 576 molecules with known fathead minnow fish toxicity, the cross-validated correlation coefficients (R-CV(2)) from a chemical descriptor method ( CHEM) and SVILP are 0.52 and 0.66, respectively. The ILP, CHEM, and SVILP approaches correctly predict 55, 58, and 73%, respectively, of toxic molecules. In a set of 165 unseen molecules, the R-2 values from the commercial software TOPKAT and SVILP are 0.26 and 0.57, respectively. In all calculations, SVILP showed significant improvements in comparison with the other methods. The SVILP approach has a major advantage in that it uses ILP automatically and consistently to derive rules, mostly novel, describing fragments that are toxicity alerts. The SVILP is a general machine-learning approach and has the potential of tackling many problems relevant to chemoinformatics including in silico drug design.
引用
收藏
页码:998 / 1006
页数:9
相关论文
共 46 条
[11]   PARTIAL LEAST-SQUARES REGRESSION - A TUTORIAL [J].
GELADI, P ;
KOWALSKI, BR .
ANALYTICA CHIMICA ACTA, 1986, 185 :1-17
[12]   ATOMIC PHYSICOCHEMICAL PARAMETERS FOR 3-DIMENSIONAL STRUCTURE DIRECTED QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS .3. MODELING HYDROPHOBIC INTERACTIONS [J].
GHOSE, AK ;
PRITCHETT, A ;
CRIPPEN, GM .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1988, 9 (01) :80-90
[13]  
Harris CA, 1997, ENVIRON HEALTH PERSP, V105, P802, DOI 10.2307/3433697
[14]  
Hathway D.E., 1984, MOL ASPECTS TOXICOLO
[15]   A survey of the Predictive Toxicology Challenge 2000-2001 [J].
Helma, C ;
Kramer, S .
BIOINFORMATICS, 2003, 19 (10) :1179-1182
[16]   Role of fluorescence line-narrowing spectroscopy and related luminescence-based techniques in the elucidation of mechanisms of tumor initiation by polycyclic aromatic hydrocarbons and estrogens [J].
Jankowiak, R ;
Rogan, EG ;
Cavalieri, EL .
JOURNAL OF PHYSICAL CHEMISTRY B, 2004, 108 (29) :10266-10283
[17]   Using probabilistic neural networks to model the toxicity of chemicals to the fathead minnow (Pimephales promelas):: A study based on 865 compounds [J].
Kaiser, KLE ;
Niculescu, SP .
CHEMOSPHERE, 1999, 38 (14) :3237-3245
[18]   QSPR - THE CORRELATION AND QUANTITATIVE PREDICTION OF CHEMICAL AND PHYSICAL-PROPERTIES FROM STRUCTURE [J].
KATRITZKY, AR ;
LOBANOV, VS ;
KARELSON, M .
CHEMICAL SOCIETY REVIEWS, 1995, 24 (04) :279-+
[19]  
KAUNYU L, 2004, TOXICOLOGY, V200, P113
[20]   Structure-activity relationships derived by machine learning: The use of atoms and their bond connectivities to predict mutagenicity by inductive logic programming [J].
King, RD ;
Muggleton, SH ;
Srinivasan, A ;
Sternberg, MJE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (01) :438-442