Development of CYP3A4 inhibition models: Comparisons of machine-learning techniques and molecular descriptors

被引:59
作者
Arimoto, R [1 ]
Prasad, MA [1 ]
Gifford, EM [1 ]
机构
[1] Pfizer Global Res & Dev, Ann Arbor, MI USA
关键词
CYP3A4; BFC; in silico screening; machine learning; structural fingerprint; similarity index; kappa;
D O I
10.1177/1087057104274091
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Computational models of cytochrome P450 3A4 inhibition were developed based on high-throughput screening data for 4470 proprietary compounds. Multiple models differentiating inhibitors (IC50 < 3 mu M) and noninhibitors were generated using various machine-learning algorithms (recursive partitioning [RP], Bayesian classifier, logistic regression, k-nearest-neighbor, and support vector machine [SVM]) with structural fingerprints and topological indices. Nineteen models were evaluated by internal 10-fold cross-validation and also by an independent test set. Three most predictive models, Barnard Chemical Information (BCI)-fingerprint/SVM, MDL-keyset/SVM, and topological indices/RP, correctly classified 249,248, and 236 compounds of 291 noninhibitors and 135,137, and 147 compounds of 179 inhibitors in the validation set. Their overall accuracies were 82%, 82%, and 81%, respectively. Investigating applicability of the BCI/SVM model found a strong correlation between the predictive performance and the structural similarity to the training set. Using Tanimoto similarity index as a confidence measurement for the predictions, the limitation of the extrapolation was 0.7 in the case of the BCI/SVM model. Taking consensus of the 3 best models yielded a further improvement in predictive capability, kappa = 0.65 and accuracy = 83%. The consensus model could also be tuned to minimize either false positives or false negatives depending on the emphasis of the screening. (Journal of Biomolecular Screening 2005:197-205).
引用
收藏
页码:197 / 205
页数:9
相关论文
共 54 条
[1]   CISAPRIDE AND TORSADES-DE-POINTES [J].
AHMAD, SR ;
WOLFE, SM .
LANCET, 1995, 345 (8948) :508-508
[2]  
Baune B, 1999, DRUG METAB DISPOS, V27, P565
[3]   Qualitative and quantitative assessment of drug-drug interaction potential in man, based on Ki, IC50 and inhibitor concentration [J].
Blanchard, N ;
Richert, L ;
Coassolo, P ;
Lavé, T .
CURRENT DRUG METABOLISM, 2004, 5 (02) :147-156
[4]  
Boxenbaum H, 1999, J PHARM PHARM SCI, V2, P47
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   Using iterated bagging to debias regressions [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (03) :261-277
[8]  
Breiman L., 1998, CLASSIFICATION REGRE
[9]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[10]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46