ADME evaluation in drug discovery. 4. Prediction of aqueous solubility based on atom contribution approach

被引:203
作者
Hou, TJ [1 ]
Xia, K [1 ]
Zhang, W [1 ]
Xu, XJ [1 ]
机构
[1] Peking Univ, Coll Chem & Mol Engn, Beijing 100871, Peoples R China
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2004年 / 44卷 / 01期
关键词
D O I
10.1021/ci034184n
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A novel method for the estimation of aqueous solubility was solely based on simple atom contribution. Each atom in a molecule has its own contribution to aqueous solubility and was developed. Altogether 76 atom types were used to classify atoms with different chemical environments. Moreover, two correction factors, including hydrophobic carbon and square of molecular weight, were used to account for the inter-/ intramolecular hydrophobic interactions and bulkiness effect. The contribution coefficients of different atom types and correction factors were generated based on a multiple linear regression using a learning set consisting of 1290 organic compounds. The obtained linear regression model possesses good statistical significance with an overall correlation coefficient (r) of 0.96, a standard deviation (s) of 0.61, and an unsigned mean error (UME) of 0.48. The actual prediction potential of the model was validated through an external test set with 21 pharmaceutically and environmentally interesting compounds. For the test set, a predictive r = 0.94, s = 0.84, and UME = 0.52 were achieved. Comparisons among eight procedures of solubility calculation for those 21 molecules demonstrate that our model bears very good accuracy and is comparable to or even better than most reported techniques based on molecular descriptors. Moreover, we compared the performance of our model to a test set of 120 molecules with a popular group contribution method developed by Klopman et al. For this test set, our model gives a very effective prediction (r = 0.96, s = 0.79, UME = 0.57), which is obviously superior to the predicted results (r = 0.96, s = 0.84, UME = 0.70) given by the Klopman's group contribution approach. Because of the adoption of atoms as the basic units, our addition model does not contain a "missing fragment" problem and thus may be more simple and universal than the group contribution models and can give predictions for any organic molecules. A program, drug-LOGS, had been developed to identify the occurrence of atom types and estimate the aqueous solubility of a molecule.
引用
收藏
页码:266 / 275
页数:10
相关论文
共 34 条
[1]  
*ACC INC, WEBLABVIEWER
[2]  
Butina D, 2003, J CHEM INF COMP SCI, V43, P837, DOI [10.1021/ci020279y, 10.1021/6020279y]
[3]  
ECELROY NR, 2001, J CHEM INF COMP SCI, V41, P1237
[4]   High-throughput, in silico prediction of aqueous solubility based on one- and two-dimensional descriptors [J].
Engkvist, O ;
Wrede, P .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (05) :1247-1249
[6]  
Halgren TA, 1996, J COMPUT CHEM, V17, P490, DOI [10.1002/(SICI)1096-987X(199604)17:5/6<490::AID-JCC1>3.0.CO
[7]  
2-P, 10.1002/(SICI)1096-987X(199604)17:5/6<616::AID-JCC5>3.0.CO
[8]  
2-X]
[9]  
Hou TJ, 2003, J CHEM INF COMP SCI, V43, P1058, DOI 10.1021/6034007rn
[10]   ADME evaluation in drug discovery - 1. Applications of genetic algorithms to the prediction of blood-brain partitioning of a large set of drugs [J].
Hou, TJ ;
Xu, XJ .
JOURNAL OF MOLECULAR MODELING, 2002, 8 (12) :337-349