Reliability of logP predictions based on calculated molecular descriptors:: A critical review

被引:76
作者
Erös, D
Kövesdi, I
Örfi, L
Takács-Novák, K
Acsády, G
Kéri, G
机构
[1] Semmelweis Univ, Dept Pharmaceut Chem, Cooperat Res Ctr, H-1092 Budapest, Hungary
[2] Semmelweis Univ, Dept Med Chem, Peptide Biochem Res Grp, H-1088 Budapest, Hungary
[3] Vichem Chem Ltd, H-1022 Budapest, Hungary
[4] Semmelweis Univ, Dept Cardiovasc Surg, H-1122 Budapest, Hungary
关键词
logP prediction; drugs; neural network; lipophilicity; in-silico-screening; combinatorial libraries;
D O I
10.2174/0929867023369042
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Correct QSAR analysis requires reliable measured or calculated logP values, being logP the most frequently utilized and most important physico-chemical parameter in such studies. Since the publication of theoretical fundamentals of logP prediction, many commercial software solutions are available. These programs are all based on experimental data of huge databases therefore the predicted logP values are mostly acceptable especially for known structures and their derivatives. In this study we critically reviewed the published methods and compared the predictive power of commercial softwares (CLOGP, KOWWIN, SciLogP/ULTRA) to each other and to our recently developed automatic QS(P)AR program. We have selected a very diverse set of 625 known drugs (98%) and drug-like molecules with experimentally validated logP values. We have collected 78 reported "outliers" as well, which could not be predicted by the "traditional" methods. We used these data in the model buildings and validations. Finally, we used an external validation set of compounds missing from public databases. We emphasized the importance of data quality, descriptor calculation and selection, and presented a general, reliable descriptor selection and validation technique for such kind of studies. Our method is based on the strictest mathematical and statistical rules, fully automatic and after the initial settings there is no option for user intervention. Three approaches were applied: multiple linear regression, partial least squares analysis and artificial neural network. LogP predictions with a multiple linear regression model showed acceptable accuracy for new compounds therefore it can be used for "in-silico-screening" and/or planning virtual/combinatorial libraries.
引用
收藏
页码:1819 / 1829
页数:11
相关论文
共 48 条
[1]   On the partition of ampholytes: Application to blood-brain distribution [J].
Abraham, MH ;
TakacsNovak, K ;
Mitchell, RC .
JOURNAL OF PHARMACEUTICAL SCIENCES, 1997, 86 (03) :310-315
[2]   FUNCTIONAL-GROUP CONTRIBUTIONS TO DRUG RECEPTOR INTERACTIONS [J].
ANDREWS, PR ;
CRAIK, DJ ;
MARTIN, JL .
JOURNAL OF MEDICINAL CHEMISTRY, 1984, 27 (12) :1648-1657
[3]  
[Anonymous], [No title captured]
[4]  
[Anonymous], 014201 US GEOL SURV
[5]  
BODOR N, 1994, J MOL STRUC-THEOCHEM, V115, P259, DOI 10.1016/0166-1280(94)80078-2
[6]   Molecular size based approach to estimate partition properties for organic solutes [J].
Bodor, N ;
Buchwald, P .
JOURNAL OF PHYSICAL CHEMISTRY B, 1997, 101 (17) :3404-3412
[7]   A NEW METHOD FOR THE ESTIMATION OF PARTITION-COEFFICIENT [J].
BODOR, N ;
GABANYI, Z ;
WONG, CK .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1989, 111 (11) :3783-3786
[8]   Prediction of the n-octanol/water partition coefficient, logP, using a combination of semiempirical MO-calculations and a neural network [J].
Breindl, A ;
Beck, B ;
Clark, T ;
Glen, RC .
JOURNAL OF MOLECULAR MODELING, 1997, 3 (03) :142-155
[9]  
BROTO P, 1984, EUR J MED CHEM, V19, P71
[10]   COMPUTATION OF MOLECULAR VOLUME [J].
CONNOLLY, ML .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1985, 107 (05) :1118-1124