Classification of Cytochrome P450 Inhibitors and Noninhibitors Using Combined Classifiers

被引:173
作者
Cheng, Feixiong [1 ]
Yu, Yue [1 ]
Shen, Jie [1 ]
Yang, Lei [3 ]
Li, Weihua [1 ]
Liu, Guixia [1 ]
Lee, Philip W. [1 ,2 ]
Tang, Yun [1 ]
机构
[1] E China Univ Sci & Technol, Sch Pharm, Dept Pharmaceut Sci, Shanghai 200237, Peoples R China
[2] Kyoto Univ, Grad Sch Agr, Sakyo Ku, Kyoto 6068502, Japan
[3] E China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
DRUG DISCOVERY; IN-VITRO; PREDICTION; QSAR; 2D; METABOLISM; DOMAIN; MODEL; 3A4;
D O I
10.1021/ci200028n
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Adverse side effects of drug drug interactions induced by human cytochrome P450 (CYP) inhibition is an important consideration, especially, during the research phase of drug discovery. It is highly desirable to develop computational models that can predict the inhibitive effect of a compound against a specific CYP isoform. In this study, inhibitor predicting models were developed for five major CYP isoforms, namely 1A2, 2C9, 2C19, 2D6, and 3A4, using a combined classifier algorithm on a large data set containing more than 24,700 unique compounds, extracted from PubChem. The combined classifiers algorithm is an ensemble of different independent machine learning classifiers including support vector machine, C4.5 decision tree, k-nearest neighbor, and naive Bayes, fused by a back-propagation artificial neural network (BP-ANN). All developed models were validated by 5-fold cross-validation and a diverse validation set composed of about 9000 diverse unique compounds. The range of the area under the receiver operating characteristic curve (AUC) for the validation sets was 0.764 to 0.815 for CYP1A2, 0.837 to 0.861 for CYP2C9, 0.793 to 0.842 for CYP2C19, 0.839 to 0.886 for CYP2D6, and 0.754 to 0.790 for CYP3A4, respectively, using the new developed combined classifiers. The overall performance of the combined classifiers fused by BP-ANN was superior to that of three classic fusion techniques (Mean, Maximum, and Multiply). The chemical spaces of data sets were explored by multidimensional scaling plots, and the use of applicability domain improved the prediction accuracies of models. In addition, some representative substructure fragments differentiating CYP inhibitors and noninhibitors were characterized by the substructure fragment analysis. These classification models are applicable for virtual screening of the five major CYP isoforms inhibitors or can be used as simple filters of potential chemicals in drug discovery.
引用
收藏
页码:996 / 1011
页数:16
相关论文
共 59 条
[1]  
Accelrys Software Inc, 2004, DISC STUD MOD ENV RE
[2]   On the use of neural network ensembles in QSAR and QSPR [J].
Agrafiotis, DK ;
Cedeño, W ;
Lobanov, VS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (04) :903-911
[3]  
[Anonymous], 2014, C4. 5: programs for machine learning
[4]  
[Anonymous], 2009, NEURAL NETWORKS LEAR
[5]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[6]   The conduct of in vitro and in vivo drug-drug interaction studies: A Pharmaceutical Research and Manufacturers of America (PhRMA) perspective [J].
Bjornsson, TD ;
Callaghan, JT ;
Einolf, HJ ;
Fischer, V ;
Gan, L ;
Grimm, S ;
Kao, J ;
King, SP ;
Miwa, G ;
Ni, L ;
Kumar, G ;
McLeod, J ;
Obach, RS ;
Roberts, S ;
Roe, A ;
Shah, A ;
Snikeris, F ;
Sullivan, JT ;
Tweedie, D ;
Vega, JM ;
Walsh, J ;
Wrighton, SA .
DRUG METABOLISM AND DISPOSITION, 2003, 31 (07) :815-832
[7]   Domain of EPI suite biotransformation models [J].
Boethling, R. S. ;
Costanza, J. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2010, 21 (5-6) :415-443
[8]  
Chang C.-C., LIBSVM: a Library for Support Vector Machines
[9]   In silico prediction of Tetrahymena pyriformis toxicity for diverse industrial chemicals with substructure pattern recognition and machine learning methods [J].
Cheng, Feixiong ;
Shen, Jie ;
Yu, Yue ;
Li, Weihua ;
Liu, Guixia ;
Lee, Philip W. ;
Tang, Yun .
CHEMOSPHERE, 2011, 82 (11) :1636-1643
[10]  
Cortes C., 1995, Machine Learning, V297, P273, DOI [DOI 10.1007/BF00994018, DOI 10.1023/A:1022627411411]