A novel logic-based approach for quantitative toxicology prediction

被引:34
作者
Amini, Ata
Muggleton, Stephen H.
Lodhi, Huma
Sternberg, Michael J. E. [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Div Mol Biosci, Struct Bioinformat Grp, Ctr Bioinformat, London SW7 2AZ, England
[2] Univ London Imperial Coll Sci Technol & Med, Dept Comp, Computat Bioinformat Lab, London SW7 2AZ, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1021/ci600223d
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
There is a pressing need for accurate in silico methods to predict the toxicity of molecules that are being introduced into the environment or are being developed into new pharmaceuticals. Predictive toxicology is in the realm of structure activity relationships ( SAR), and many approaches have been used to derive such SAR. Previous work has shown that inductive logic programming (ILP) is a powerful approach that circumvents several major difficulties, such as molecular superposition, faced by some other SAR methods. The ILP approach reasons with chemical substructures within a relational framework and yields chemically understandable rules. Here, we report a general new approach, support vector inductive logic programming (SVILP), which extends the essentially qualitative ILP-based SAR to quantitative modeling. First, ILP is used to learn rules, the predictions of which are then used within a novel kernel to derive a support-vector generalization model. For a highly heterogeneous dataset of 576 molecules with known fathead minnow fish toxicity, the cross-validated correlation coefficients (R-CV(2)) from a chemical descriptor method ( CHEM) and SVILP are 0.52 and 0.66, respectively. The ILP, CHEM, and SVILP approaches correctly predict 55, 58, and 73%, respectively, of toxic molecules. In a set of 165 unseen molecules, the R-2 values from the commercial software TOPKAT and SVILP are 0.26 and 0.57, respectively. In all calculations, SVILP showed significant improvements in comparison with the other methods. The SVILP approach has a major advantage in that it uses ILP automatically and consistently to derive rules, mostly novel, describing fragments that are toxicity alerts. The SVILP is a general machine-learning approach and has the potential of tackling many problems relevant to chemoinformatics including in silico drug design.
引用
收藏
页码:998 / 1006
页数:9
相关论文
共 46 条
[1]  
[Anonymous], 1997, Machine Learning
[2]   Computational predictive programs (expert systems) in toxicology [J].
Benfenati, E ;
Gini, G .
TOXICOLOGY, 1997, 119 (03) :213-225
[3]   Computational studies on biphenyl derivatives.: Analysis of the conformational mobility, molecular electrostatic potential, and dipole moment of chlorinated biphenyl:: Searching for the rationalization of the selective toxicity of polychlorinated biphenyls (PCBs) [J].
Chana, A ;
Concejero, MA ;
de Frutos, M ;
González, MJ ;
Herradón, B .
CHEMICAL RESEARCH IN TOXICOLOGY, 2002, 15 (12) :1514-1526
[4]   Assessment and modeling of the toxicity of organic chemicals to Chlorella vulgaris:: Development of a novel database [J].
Cronin, MTD ;
Netzeva, TI ;
Dearden, JC ;
Edwards, R ;
Worgan, ADP .
CHEMICAL RESEARCH IN TOXICOLOGY, 2004, 17 (04) :545-554
[5]   Categorical regression of toxicity data: A case study using adicarb [J].
Dourson, ML ;
Teuschler, LK ;
Durkin, PR ;
Stiteler, WM .
REGULATORY TOXICOLOGY AND PHARMACOLOGY, 1997, 25 (02) :121-129
[6]   USE OF SAR IN COMPUTER-ASSISTED PREDICTION OF CARCINOGENICITY AND MUTAGENICITY OF CHEMICALS BY THE TOPKAT PROGRAM [J].
ENSLEIN, K ;
GOMBAR, VK ;
BLAKE, BW .
MUTATION RESEARCH, 1994, 305 (01) :47-61
[7]   Modes of action in ecotoxicology: Their role in body burdens, species sensitivity, QSARs, and mixture effects [J].
Escher, BI ;
Hermens, JLM .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2002, 36 (20) :4201-4217
[8]   Pharmacophore discovery using the Inductive Logic Programming system PROGOL [J].
Finn, P ;
Muggleton, S ;
Page, D ;
Srinivasan, A .
MACHINE LEARNING, 1998, 30 (2-3) :241-270
[9]   Structure-activity relationships for the antileishmanial and antitrypanosomal activities of 1'-substituted 9-anilinoacridines [J].
Gamage, SA ;
Figgitt, DP ;
Wojcik, SJ ;
Ralph, RK ;
Ransijn, A ;
Mauel, J ;
Yardley, V ;
Snowdon, D ;
Croft, SL ;
Denny, WA .
JOURNAL OF MEDICINAL CHEMISTRY, 1997, 40 (16) :2634-2642
[10]  
GARTNER T, 2000, P 19 INT C MACH LEAR, P79