A comparative study on the application of hierarchical-agglomerative clustering approaches to organize outputs of reiterated docking runs

被引:51
作者
Bottegoni, G [1 ]
Cavalli, A [1 ]
Recanatini, M [1 ]
机构
[1] Univ Bologna, Dept Pharmaceut Sci, I-40126 Bologna, Italy
关键词
D O I
10.1021/ci050141q
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Reiterated runs of standard docking protocols usually provide a collection of possible binding modes rather than pinpoint a single solution. Usually, this ensemble is then ranked by means of an energy-based scoring function. However, since many degrees of approximation have to be introduced in the computation of the binding free energy, scoring functions cannot always rank the experimental pose. among the top scorers. Cluster analysis might help to overcome this limit, provided that data clusterability has been earlier assessed. In this paper, first, we present a modified version of a test earlier developed by Hopkins to assess whether or not docking outputs show the natural tendency to be grouped in clusters. Then, we report the results of a comparative study on the application of different hierarchical-agglomerative cluster rules to partition docking outputs. The rule that was able to best manage the observed data was finally applied to the whole ensemble of poses collected from several docking tools. The combination of the average linkage rule with the cutting function developed by Sutcliffe and co-workers turned out to be an approach that meets all of the criteria required for a robust clustering protocol. Furthermore, a consensus clustering allowed us to identify the pose closest to the experimental one within a statistically significant cluster, whose number was always of few units.
引用
收藏
页码:852 / 862
页数:11
相关论文
共 33 条
[11]   Development and validation of a genetic algorithm for flexible docking [J].
Jones, G ;
Willett, P ;
Glen, RC ;
Leach, AR ;
Taylor, R .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 267 (03) :727-748
[12]   The many roles of computation in drug discovery [J].
Jorgensen, WL .
SCIENCE, 2004, 303 (5665) :1813-1818
[13]   Assessment of multiple binding modes in ligand-protein docking [J].
Källblad, P ;
Mancera, RL ;
Todorov, NP .
JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (13) :3334-3337
[14]   Comparative evaluation of eight docking tools for docking and virtual screening accuracy [J].
Kellenberger, E ;
Rodrigo, J ;
Muller, P ;
Rognan, D .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (02) :225-242
[15]   An automated approach for defining core atoms and domains in an ensemble of NMR-derived protein structures [J].
Kelley, LA ;
Gardner, SP ;
Sutcliffe, MJ .
PROTEIN ENGINEERING, 1997, 10 (06) :737-741
[16]   Docking and scoring in virtual screening for drug discovery: Methods and applications [J].
Kitchen, DB ;
Decornez, H ;
Furr, JR ;
Bajorath, J .
NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (11) :935-949
[17]   Evaluation of docking performance: Comparative data on docking algorithms [J].
Kontoyianni, M ;
McClellan, LM ;
Sokol, GS .
JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (03) :558-565
[18]   AN EXAMINATION OF PROCEDURES FOR DETERMINING THE NUMBER OF CLUSTERS IN A DATA SET [J].
MILLIGAN, GW ;
COOPER, MC .
PSYCHOMETRIKA, 1985, 50 (02) :159-179
[19]   MACROMODEL - AN INTEGRATED SOFTWARE SYSTEM FOR MODELING ORGANIC AND BIOORGANIC MOLECULES USING MOLECULAR MECHANICS [J].
MOHAMADI, F ;
RICHARDS, NGJ ;
GUIDA, WC ;
LISKAMP, R ;
LIPTON, M ;
CAUFIELD, C ;
CHANG, G ;
HENDRICKSON, T ;
STILL, WC .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1990, 11 (04) :440-467
[20]  
MOJENA R, 1977, COMPUT J, V20, P353