A multivariate approach to investigate docking parameters' effects on docking performance

被引:22
作者
Andersson, C. David
Thysell, Elin
Lindstrom, Anton
Bylesjo, Max
Raubacher, Florian
Linusson, Anna [1 ]
机构
[1] Umea Univ, Dept Chem, S-90187 Umea, Sweden
[2] AstraZeneca R&D, S-43183 Molndal, Sweden
关键词
D O I
10.1021/ci6005596
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Increasingly powerful docking programs for analyzing and estimating the strength of protein-ligand interactions have been developed in recent decades, and they are now valuable tools in drug discovery. Software used to perform dockings relies on a number of parameters that affect various steps in the docking procedure. However, identifying the best choices of the settings for these parameters is often challenging. Therefore, the settings of the parameters are quite often left at their default values, even though scientists with long experience with a specific docking tool know that modifying certain parameters can improve the results. In the study presented here, we have used statistical experimental design and subsequent regression based on root-mean-square deviation values using partial least-square projections to latent structures (PLS) to scrutinize the effects of different parameters on the docking performance of two software packages: FRED and GOLD. Protein-ligand complexes with a high level of ligand diversity were selected from the PDBbind database for the study, using principal component analysis based on 1D and 2D descriptors, and space-filling design. The PLS models showed quantitative relationships between the docking parameters and the ability of the programs to reproduce the ligand crystallographic conformation. The PLS models also revealed which of the parameters and what parameter settings were important for the docking performance of the two programs. Furthermore, the variation in docking results obtained with specific parameter settings for different protein-ligand complexes in the diverse set examined indicates that there is great potential for optimizing the parameter settings for selected sets of proteins.
引用
收藏
页码:1673 / 1687
页数:15
相关论文
共 68 条
[11]   D-optimal designs [J].
deAguiar, PF ;
Bourguignon, B ;
Khots, MS ;
Massart, DL ;
PhanThanLuu, R .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1995, 30 (02) :199-210
[12]   A SIMPLE BAYESIAN MODIFICATION OF D-OPTIMAL DESIGNS TO REDUCE DEPENDENCE ON AN ASSUMED MODEL [J].
DUMOUCHEL, W ;
JONES, B .
TECHNOMETRICS, 1994, 36 (01) :37-47
[13]   A LEISURELY LOOK AT THE BOOTSTRAP, THE JACKKNIFE, AND CROSS-VALIDATION [J].
EFRON, B ;
GONG, G .
AMERICAN STATISTICIAN, 1983, 37 (01) :36-48
[14]  
Eriksson L, 1996, J CHEMOMETR, V10, P483
[15]  
Eriksson L., 2006, MULTI MEGAVARIATE DA, DOI [10.1002/cem.713, DOI 10.1002/CEM.713]
[16]   Robustness testing of a tablet formulation using multivariate design [J].
Gabrielsson, J ;
Sjöström, M ;
Lindberg, NO ;
Pihl, AC ;
Lundstedt, T .
DRUG DEVELOPMENT AND INDUSTRIAL PHARMACY, 2006, 32 (03) :297-307
[17]   Multivariate methods in pharmaceutical applications [J].
Gabrielsson, J ;
Lindberg, NO ;
Lundstedt, T .
JOURNAL OF CHEMOMETRICS, 2002, 16 (03) :141-160
[18]   Knowledge-based scoring function to predict protein-ligand interactions [J].
Gohlke, H ;
Hendlich, M ;
Klebe, G .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 295 (02) :337-356
[19]   Principles of docking: An overview of search algorithms and a guide to scoring functions [J].
Halperin, I ;
Ma, BY ;
Wolfson, H ;
Nussinov, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 47 (04) :409-443
[20]   MOLECULAR RECOGNITION OF RECEPTOR-SITES USING A GENETIC ALGORITHM WITH A DESCRIPTION OF DESOLVATION [J].
JONES, G ;
WILLETT, P ;
GLEN, RC .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 245 (01) :43-53