Prediction of human cytochrome P450 inhibition using support vector machines

被引:34
作者
Kriegl, JM [1 ]
Arnhold, T
Beck, B
Fox, T
机构
[1] Boehringer Ingelheim Pharma GmbH & Co KG, Computat Chem, Dept Lead Discovery, D-88397 Biberach, Germany
[2] Boehringer Ingelheim Pharma GmbH & Co KG, DDS, DMPK, Dept Drug Discovery Support, D-88397 Biberach, Germany
来源
QSAR & COMBINATORIAL SCIENCE | 2005年 / 24卷 / 04期
关键词
in silico-ADMET; cytochrome P450; CYP3A4; CYP2D6; QSAR; molecular descriptors; support vector machines; PLS;
D O I
10.1002/qsar.200430925
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Cytochrome P450s (CYPs) play a major role in the metabolism of drugs utilized in human health care. Inhibition of these enzymes by a drug may result in unwanted drug-drug interactions when two or more drugs are coadministered. Therefore, CYP inhibition should be investigated as early as possible in lead discovery and lead optimization. In silico approaches are highly desirable to assess large data sets or virtual compounds. Here we present the application of support vector machines (SVMs) to predict the potency of structurally diverse drug-like molecules to inhibit the human cytochromes P450 3A4 (CYP3A4) and 2D6 (CYP2D6). Different descriptor sets were used to cover various aspects of molecular properties, including physico-chemical properties derived from the 2D structure, the interactions of the molecule with its environment, and properties derived from quantum-mechanical calculations. Support vector classifiers were trained to distinguish between strong, medium, and weak inhibitors. For both iscienzymes, independent test set compounds were correctly re-classified with an accuracy of approximately 70%. The data sets were also used to generate support vector regression models. The best models were able to predict the log IC50 values of the test set compounds with a squared correlation coefficient of R-2=0.67 (CYP3A4, corresponding RMSE of 0.36 log units) and R-2=0.66 (CYP2D6, corresponding RMSE of 0.44 log units). Our results show that SVMs are a very powerful tool to predict CYP inhibition liability from calculated physico-chemical properties without invoking any information about the active site of the enzyme. The models can, for instance, be utilized to flag problematic compounds in an early step or to guide further synthesis efforts in a later stage of a research project.
引用
收藏
页码:491 / 502
页数:12
相关论文
共 62 条
[1]   Conformer- and alignment-independent model for predicting structurally diverse competitive CYP2C9 inhibitors [J].
Afzelius, L ;
Zamora, I ;
Masimirembwa, CM ;
Karlén, A ;
Andersson, TB ;
Mecucci, S ;
Baroni, M ;
Cruciani, G .
JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (04) :907-914
[2]   Discriminant and quantitative PLS analysis of competitive CYP2C9 inhibitors versus non-inhibitors using alignment independent GRIND descriptors [J].
Afzelius, L ;
Masimirembwa, CM ;
Karlén, A ;
Andersson, TB ;
Zamora, I .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2002, 16 (07) :443-458
[3]  
[Anonymous], NC2TR1998030
[4]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[5]  
Beck B, 1997, J COMPUT CHEM, V18, P744, DOI 10.1002/(SICI)1096-987X(19970430)18:6<744::AID-JCC2>3.0.CO
[6]  
2-S
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]   Comparison of support vector machine and artificial neural network systems for drug/nondrug classification [J].
Byvatov, E ;
Fechner, U ;
Sadowski, J ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (06) :1882-1889
[9]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[10]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411