ADME evaluation in drug discovery. 8. The prediction of human intestinal absorption by a support vector machine

被引:106
作者
Hou, Tingjun [1 ]
Wang, Junmei
Li, Youyong
机构
[1] Univ Calif San Diego, Ctr Theoret Biol Phys, Dept Chem & Biochem, La Jolla, CA 92093 USA
[2] Encys Pharmaceut Inc, Houston, TX 77030 USA
[3] CALTECH, Mat & proc Simulat Ctr, Pasadena, CA 91125 USA
关键词
D O I
10.1021/ci7002076
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Human intestinal absorption (HIA) is an important roadblock in the formulation of new drug substances. In silico models for predicting the percentage of HIA based on calculated molecular descriptors are highly needed for the rapid estimation of this property. Here, we have studied the performance of a support vector machine (SVM) to classify compounds with high or low fractional absorption (%FA > 30% or %FA :5 30%). The analyzed data set consists of 578 structural diverse druglike molecules, which have been divided into a 480-molecule training set and a 98-molecule test set. Ten SVM classification models have been generated to investigate the impact of different individual molecular properties on %FA. Among these studied important molecule descriptors, topological polar surface area (TPSA) and predicted apparent octanol - water distribution coefficient at pH 6.5 (logD(6.5)) show better classification performance than the others. To obtain the best SVM classifier, the influences of different kernel functions and different combinations of molecular descriptors were investigated using a rigorous training-validation procedure. The best SVM classifier can give satisfactory predictions for the training set (97.8% for the poor-absorption class and 94.5% for the good-absorption class). Moreover, 100% of the poor-absorption class and 97.8% of the good-absorption class in the external test set could be correctly classified. Finally, the influence of the size of the training set and the unbalanced nature of the data set have been studied. The analysis demonstrates that large data set is necessary for the stability of the classification models. Furthermore, the weights for the poor-absorption class and the good-absorption class should be properly balanced to generate unbiased classification models. Our work illustrates that SVMs used in combination with simple molecular descriptors can provide an extremely reliable assessment of intestinal absorption in an early in silico filtering process.
引用
收藏
页码:2408 / 2415
页数:8
相关论文
共 41 条
[1]  
*ACDLABS, V90 ACDLABS
[2]   The influence of nonspecific microsomal binding on apparent intrinsic clearance, and its prediction from physicochemical properties [J].
Austin, RP ;
Barton, P ;
Cockroft, SL ;
Wenlock, MC ;
Riley, RJ .
DRUG METABOLISM AND DISPOSITION, 2002, 30 (12) :1497-1503
[3]  
Beresford AP, 2004, CURR OPIN DRUG DISC, V7, P36
[4]   A high-throughput screening method for the determination of aqueous drug solubility using laser nephelometry in microtiter plates [J].
Bevan, CD ;
Lloyd, RS .
ANALYTICAL CHEMISTRY, 2000, 72 (08) :1781-1787
[5]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[6]   Drug design by machine learning: support vector machines for pharmaceutical data analysis [J].
Burbidge, R ;
Trotter, M ;
Buxton, B ;
Holden, S .
COMPUTERS & CHEMISTRY, 2001, 26 (01) :5-14
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]   Comparison of support vector machine and artificial neural network systems for drug/nondrug classification [J].
Byvatov, E ;
Fechner, U ;
Sadowski, J ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (06) :1882-1889
[9]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[10]   In vitro drug interactions of cytochrome P450: An evaluation of fluorogenic to conventional substrates [J].
Cohen, LH ;
Remley, MJ ;
Raunig, D ;
Vaz, ADN .
DRUG METABOLISM AND DISPOSITION, 2003, 31 (08) :1005-1015