Classification of Cytochrome P450 Activities Using Machine Learning Methods

被引:32
作者
Hammann, Felix [1 ]
Gutmann, Heike [1 ]
Baumann, Ulli [1 ]
Helma, Christoph [2 ]
Drewe, Juergen [1 ]
机构
[1] Univ Basel, Univ Basel Hosp, Dept Gastroenterol & Hepatol, CH-4031 Basel, Switzerland
[2] Univ Freiburg, Freiburg Ctr Data Anal & Modelling, Freiburg, Germany
关键词
QSAR; cytochrome P-450; machine learning; drug safety; drug design; support vector machine; artificial neural network; decision trees; k nearest neighbors; random forest; PREDICTION; INHIBITORS; ISOENZYMES; INDUCERS;
D O I
10.1021/mp900217x
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
The cytochrome P-450 (GYP) system plays an integral part in the metabolism of drugs and other xenobiotics. Knowledge of the structural features required for interaction with any of the different isoforms of the CYP system is therefore immensely valuable in early drug discovery. In this paper, we focus on three major isoforms (CYP 1 A2, CYP 2D6, and CYP 3A4) and present a data set of 335 structurally diverse drug compounds classified for their interaction (as substrate, inhibitor, or any interaction) with these isoforms. We also present machine learning models using a variety of commonly used methods (k-nearest neighbors, decision tree induction using the CHAID and CRT algorithms, random forests, artificial neural networks, and support vector machines using the radial basis function (RBF) and homogeneous polynomials as kernel functions). We discuss the physicochemical features relevant for each end point and compare it to similar studies. Many of these models perform exceptionally well, even with 10-fold cross-validation, yielding corrected classification rates of 81.7 to 91.9% for CYP 1A2, 89.2 to 92.9% for CYP 2D6, and 87.4 to 89.9% for CYP3A4. Our models help in understanding the structural requirements for CYP interactions and can serve as sensitive tools in virtual screenings and lead optimization for toxicological profiles in drug discovery.
引用
收藏
页码:1920 / 1926
页数:7
相关论文
共 40 条
[1]  
Aizerman M. A., 1964, Automation and Remote Control, V25, P821
[2]  
[Anonymous], DETECTION INTERACTIO
[3]  
[Anonymous], P 14 INT JT C ART IN
[4]  
[Anonymous], ARTIFICIAL INTELLIGE
[5]  
[Anonymous], 2005, MORGAN KAUFMANN SERI
[6]  
[Anonymous], 1993, C4.5: Programs for machine learning
[7]  
[Anonymous], 1990, M 196 1988 LOS ANG C
[8]  
[Anonymous], MACH LEARN
[9]  
[Anonymous], 1912, Variabilita e Mutuabilita. Contributo allo Studio delle Distribuzioni e delle Relazioni Statistiche
[10]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669