Prediction of aqueous solubility of organic compounds from molecular structure

被引:125
作者
Mitchell, BE [1 ]
Jurs, PC [1 ]
机构
[1] Penn State Univ, Dept Chem, Davey Lab 152, University Pk, PA 16802 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1998年 / 38卷 / 03期
关键词
D O I
10.1021/ci970117f
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Multiple linear regression (MLR) and computational neural networks (CNN) are utilized to develop mathematical models to relate the structures of a diverse set of 332 organic compounds to their aqueous solubilities. Topological, geometric, and electronic descriptors are used to numerically represent structural features of the data set compounds. Genetic algorithm and simulated annealing routines, in conjunction with MLR and CNN, are used to select subsets of descriptors that accurately relate to aqueous solubility. Nonlinear models with nine calculated structural descriptors are developed that have a training set root-mean-square error of 0.394 log units for compounds which span a -log(molarity) range from -2 to +12 log units.
引用
收藏
页码:489 / 496
页数:8
相关论文
共 49 条
[1]   NEURAL NETWORK STUDIES .1. ESTIMATION OF THE AQUEOUS SOLUBILITY OF ORGANIC-COMPOUNDS [J].
BODOR, N ;
HARGET, A ;
HUANG, MJ .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1991, 113 (25) :9480-9483
[2]   A NEW METHOD FOR THE ESTIMATION OF THE AQUEOUS SOLUBILITY OF ORGANIC-COMPOUNDS [J].
BODOR, N ;
HUANG, MJ .
JOURNAL OF PHARMACEUTICAL SCIENCES, 1992, 81 (09) :954-960
[3]  
BODOR N, 1992, INT J QUANTUM CHEM, V26, P853
[4]  
Broyden C.G., 1970, J I MATH ITS APPL, V6, P76, DOI DOI 10.1093/IMAMAT/6.1.76
[5]  
Cao CZ, 1996, ACTA CHIM SINICA, V54, P533
[6]   ATOMIC CHARGE CALCULATIONS FOR QUANTITATIVE STRUCTURE PROPERTY RELATIONSHIPS [J].
DIXON, SL ;
JURS, PC .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1992, 13 (04) :492-504
[7]  
DIXON SL, 1994, THESIS PENNSYLVANIA
[8]   PREDICTION OF BOILING POINTS AND CRITICAL-TEMPERATURES OF INDUSTRIALLY IMPORTANT ORGANIC-COMPOUNDS FROM MOLECULAR-STRUCTURE [J].
EGOLF, LM ;
WESSEL, MD ;
JURS, PC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (04) :947-956
[9]   Prediction of supercritical carbon dioxide solubility of organic compounds from molecular structure [J].
Engelhardt, HL ;
Jurs, PC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (03) :478-484
[10]   A NEW APPROACH TO VARIABLE METRIC ALGORITHMS [J].
FLETCHER, R .
COMPUTER JOURNAL, 1970, 13 (03) :317-&