A support vector machine approach to classify human cytochrome P450 3A4 inhibitors

被引:56
作者
Kriegl, JM [1 ]
Arnhold, T
Beck, B
Fox, T
机构
[1] Boehringer Ingelheim Pharma GmbH & Co KG, Dept Lead Discovery, D-88397 Biberach, Germany
[2] Boehringer Ingelheim Pharma GmbH & Co KG, Dept Drug Discovery Support, DDS DMPK, D-88397 Biberach, Germany
关键词
ADME; cytochrome P450; in silico filter; molecular descriptor; QSAR; support vector machine;
D O I
10.1007/s10822-005-3785-3
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The cytochrome P450 (CYP) enzyme superfamily plays a major role in the metabolism of commercially available drugs. Inhibition of these enzymes by a drug may result in a plasma level increase of another drug, thus leading to unwanted drug-drug interactions when two or more drugs are coadministered. Therefore, fast and reliable in silico methods predicting CYP inhibition from calculated molecular properties are an important tool which can be applied to assess both already synthesized as well as virtual compounds. We have studied the performance of support vector machines (SVMs) to classify compounds according to their potency to inhibit CYP3A4. The data set for model generation consists of more than 1300 structural diverse drug-like research molecules which were divided into training and test sets. The predictive power of SVMs crucially depends on a careful selection of parameters specifying the kernel function and the penalty for misclassifications. In this study we have investigated a procedure to identify a valid set of SVM parameters which is based on a sampling of the parameter space on a regular grid. From this set of parameters, either single SVMs or SVM committees were trained to distinguish between strong and weak inhibitors or to achieve a more realistic three-class assignment, with one class representing medium inhibitors. This workflow was studied for several kernel functions and descriptor sets. All SVM models performed significantly better than PLS-DA models which were generated from the corresponding descriptor sets. As a very promising result, simple two-dimensional (2D) descriptors yield a three-class model which correctly classifies more than 70% of the test set. Our work illustrates that SVMs used in combination with simple 2D descriptors provide a very effective and reliable tool which allows a fast assessment of CYP3A4 inhibition potency in an early in silico filtering process.
引用
收藏
页码:189 / 201
页数:13
相关论文
共 54 条
[1]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[2]  
BISHOP CM, 2004, BR J CLIN PHARM, V57, P473
[3]   SVM-based feature selection for characterization of focused compound collections [J].
Byvatov, E ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03) :993-999
[4]   Comparison of support vector machine and artificial neural network systems for drug/nondrug classification [J].
Byvatov, E ;
Fechner, U ;
Sadowski, J ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (06) :1882-1889
[5]  
*CHEM COMP GROUP, 2003, MOL OP ENV REL 2003
[6]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[7]   VolSurf: a new tool for the pharmacokinetic optimization of lead compounds [J].
Cruciani, G ;
Pastor, M ;
Guba, W .
EUROPEAN JOURNAL OF PHARMACEUTICAL SCIENCES, 2000, 11 :S29-S39
[8]   The cytochrome P450 superfamily: Biochemistry, evolution and drug metabolism in humans [J].
Danielson, PB .
CURRENT DRUG METABOLISM, 2002, 3 (06) :561-597
[9]   THE DEVELOPMENT AND USE OF QUANTUM-MECHANICAL MOLECULAR-MODELS .76. AM1 - A NEW GENERAL-PURPOSE QUANTUM-MECHANICAL MOLECULAR-MODEL [J].
DEWAR, MJS ;
ZOEBISCH, EG ;
HEALY, EF ;
STEWART, JJP .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1985, 107 (13) :3902-3909
[10]   Generation and validation of rapid computational filters for CYP2D6 and CYP3A4 [J].
Ekins, S ;
Berbaum, J ;
Harrison, RK .
DRUG METABOLISM AND DISPOSITION, 2003, 31 (09) :1077-1080