Customizing scoring functions for docking

被引:49
作者
Pham, Tuan A. [1 ]
Jain, Ajay N. [1 ]
机构
[1] Univ Calif San Francisco, San Francisco, CA 94143 USA
关键词
D O I
10.1007/s10822-008-9174-y
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Empirical scoring functions used in protein-ligand docking calculations are typically trained on a dataset of complexes with known affinities with the aim of generalizing across different docking applications. We report a novel method of scoring-function optimization that supports the use of additional information to constrain scoring function parameters, which can be used to focus a scoring function's training towards a particular application, such as screening enrichment. The approach combines multiple instance learning, positive data in the form of ligands of protein binding sites of known and unknown affinity and binding geometry, and negative (decoy) data of ligands thought not to bind particular protein binding sites or known not to bind in particular geometries. Performance of the method for the Surflex-Dock scoring function is shown in cross-validation studies and in eight blind test cases. Tuned functions optimized with a sufficient amount of data exhibited either improved or undiminished screening performance relative to the original function across all eight complexes. Analysis of the changes to the scoring function suggest that modifications can be learned that are related to protein-specific features such as active-site mobility.
引用
收藏
页码:269 / 286
页数:18
相关论文
共 33 条
[1]  
AXELSEN PH, 1994, PROTEIN SCI, V3, P188
[2]   Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations [J].
Bissantz, C ;
Folkers, G ;
Rognan, D .
JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (25) :4759-4767
[4]   Solving the multiple instance problem with axis-parallel rectangles [J].
Dietterich, TG ;
Lathrop, RH ;
LozanoPerez, T .
ARTIFICIAL INTELLIGENCE, 1997, 89 (1-2) :31-71
[5]   3D structure of Torpedo californica acetylcholinesterase complexed with huprine X at 2.1 Å resolution:: Kinetic and molecular dynamic correlates [J].
Dvir, H ;
Wong, DM ;
Harel, M ;
Barril, X ;
Orozco, M ;
Luque, FJ ;
Muñoz-Torrero, D ;
Camps, P ;
Rosenberry, TL ;
Silman, I ;
Sussman, JL .
BIOCHEMISTRY, 2002, 41 (09) :2970-2981
[6]   Knowledge-based scoring function to predict protein-ligand interactions [J].
Gohlke, H ;
Hendlich, M ;
Klebe, G .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 295 (02) :337-356
[7]  
Goodsell DS, 1996, J MOL RECOGNIT, V9, P1, DOI 10.1002/(SICI)1099-1352(199601)9:1<1::AID-JMR241>3.0.CO
[8]  
2-6
[9]   Benchmarking sets for molecular docking [J].
Huang, Niu ;
Shoichet, Brian K. ;
Irwin, John J. .
JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (23) :6789-6801
[10]   ZINC - A free database of commercially available compounds for virtual screening [J].
Irwin, JJ ;
Shoichet, BK .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (01) :177-182