Computational chemistry development of a unified free energy Markov model for the distribution of 1300 chemicals to 38 different environmental or biological systems

被引:68
作者
Cruz-Monteagudo, Maykel
Gonzalez-Diaz, Humberto [1 ]
Agueero-Chapin, Guillermin
Santana, Lourdes
Borges, Fernanda
Rosa Dominguez, Elena
Podda, Gianni
Uriarte, Eugenio
机构
[1] Univ Santiago de Compostela, Dept Organ Chem, Fac Pharm, Santiago De Compostela 15782, Spain
[2] Univ Porto, Physicochem Mol Res Unit, Dept Organ Chem, Fac Pharm, P-4050047 Oporto, Portugal
[3] Univ Cagliari, Dipartimento Farmaco Chim Tecnol, I-09124 Cagliari, Italy
关键词
chem-informatics; quantitative structure-property relationships; Markov models; free energy; partition coefficients; chemicals environmental distribution;
D O I
10.1002/jcc.20730
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Predicting tissue and environmental distribution of chemicals is of major importance for environmental and life sciences. Most of the molecular descriptors used in computational prediction of chemicals partition behavior consider molecular structure but ignore the nature of the partition system. Consequently, computational models derived up-to-date are restricted to the specific system under study. Here, a free energy-based descriptor (del G(k)) is introduced, which circumvent this problem. Based on Delta G(k), we developed for the first time a single linear classification model to predict the partition behavior of a broad number of structurally diverse drugs and other chemicals (1300) for 38 different partition systems of biological and environmental significance. The model presented training/predicting set accuracies of 91.79/88.92%. Parametrical assumptions were checked. Desirability analysis was used to explore the levels of the predictors that produce the most desirable partition properties. Finally, inversion of the partition direction for each one of the 38 partition systems evidences that our models correctly classified 89.08% of compounds with an uncertainty of only +/- 0.17% independently of the direction of the partition process used to seek the model. Other 10 different classification models (linear, neural networks, and genetic algorithms) were also tested for the same purposes. None of these computational models favorably compare with respect to the linear model indicating that our approach capture the main aspects that govern chemicals partition in different systems. (C) 2007 Wiley Periodicals, Inc.
引用
收藏
页码:1909 / 1923
页数:15
相关论文
共 95 条
[31]  
Gonzalez Diaz Humberto, 2003, Bulletin of Mathematical Biology, V65, P991
[32]   Stochastic entropy QSAR for the in silico discovery of anticancer compounds:: Prediction, synthesis, and in vitro assay of new purine carbanucleosides [J].
González-Díaz, H ;
Viña, D ;
Santana, L ;
de Clereq, E ;
Uriarte, E .
BIOORGANIC & MEDICINAL CHEMISTRY, 2006, 14 (04) :1095-1107
[33]   QSAR study for mycobacterial promoters with low sequence homology [J].
González-Díaz, H ;
Pérez-Bello, A ;
Uriarte, E ;
González-Díaz, Y .
BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2006, 16 (03) :547-553
[34]   Proteins QSAR with Markov average electrostatic potentials [J].
González-Díaz, H ;
Uriarte, E .
BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2005, 15 (22) :5088-5094
[35]   Recognition of stable protein mutants with 3D stochastic average electrostatic potentials [J].
González-Díaz, H ;
Molina, R ;
Uriarte, E .
FEBS LETTERS, 2005, 579 (20) :4297-4301
[36]   2D RNA-QSAR:: assigning ACC oxidase family membership with stochastic molecular descriptors;: isolation and prediction of a sequence from Psidium guajava']java L [J].
González-Díaz, H ;
Agüero-Chapin, G ;
Varona-Santos, J ;
Molina, R ;
de la Riva, G ;
Uriarte, E .
BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2005, 15 (11) :2932-2937
[37]   Biopolymer stochastic moments.: I.: Modeling human rhinovirus cellular recognition with protein surface electrostatic moments [J].
González-Díaz, H ;
Uriarte, E .
BIOPOLYMERS, 2005, 77 (05) :296-303
[38]   Markovian chemicals "in silico" design (MARCH-INSIDE), a promising approach for computer-aided molecular design III:: 2.5D indices for the discovery of antibacterials [J].
González-Díaz, H ;
Torres-Gómez, LA ;
Guevara, Y ;
Almeida, MS ;
Molina, R ;
Castanedo, N ;
Santana, L ;
Uriarte, E .
JOURNAL OF MOLECULAR MODELING, 2005, 11 (02) :116-123
[39]   QSAR for anti-RNA-virus activity, synthesis, and assay of anti-RSV carbonucleosides given a unified representation of spectral moments, quadratic, and topologic indices [J].
González-Díaz, H ;
Cruz-Monteagudo, M ;
Viña, D ;
Santana, L ;
Uriarte, E ;
De Clercq, E .
BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2005, 15 (06) :1651-1657
[40]   Unified Markov thermodynamics based on stochastic forms to classify drugs considering molecular structure, partition system, and biological species:: distribution of the antimicrobial G1 on rat tissues [J].
González-Díaz, H ;
Agüero, G ;
Cabrera, MA ;
Molina, R ;
Santana, L ;
Uriarte, E ;
Delogu, G ;
Castañedo, N .
BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2005, 15 (03) :551-557