Parameter estimation for scoring protein-ligand interactions using negative training data

被引:136
作者
Pham, Tuan A.
Jain, Ajay N.
机构
[1] Univ Calif San Francisco, Inst Canc Res, Dept Biopharmaceut Sci, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Dept Lab Med, San Francisco, CA 94143 USA
关键词
D O I
10.1021/jm050040j
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Surflex-Dock employs an empirically derived scoring function to rank putative protein-ligand interactions by flexible docking of small molecules to proteins of known structure. The scoring function employed by Surflex was developed purely on the basis of positive data, comprising noncovalent protein-ligand complexes with known binding affinities. Consequently, scoring function terms for improper interactions received little weight in parameter estimation, and an ad hoc scheme for avoiding protein-ligand interpenetration was adopted. We present a generalized method for incorporating synthetically generated negative training data, which allows for rigorous estimation of all scoring function parameters. Geometric docking accuracy remained excellent under the new parametrization. In addition, a test of screening utility covering a diverse set of 29 proteins and corresponding ligand sets showed improved performance. Maximal enrichment of true ligands over nonligands exceeded 20-fold in over 80% of cases, with enrichment of greater than 100-fold in over 50% of cases.
引用
收藏
页码:5856 / 5868
页数:13
相关论文
共 34 条
  • [1] Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations
    Bissantz, C
    Folkers, G
    Rognan, D
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (25) : 4759 - 4767
  • [3] Informative library design as an efficient strategy to identify and optimize leads: Application to cyclin-dependent kinase 2 antagonists
    Bradley, EK
    Miller, JL
    Saiah, E
    Grootenhuis, PDJ
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2003, 46 (20) : 4360 - 4364
  • [4] Molecular docking and high-throughput screening for novel inhibitors of protein tyrosine phosphatase-1B
    Doman, TN
    McGovern, SL
    Witherbee, BJ
    Kasten, TP
    Kurumbail, R
    Stallings, WC
    Connolly, DT
    Shoichet, BK
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2002, 45 (11) : 2213 - 2221
  • [5] Empirical scoring functions .1. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes
    Eldridge, MD
    Murray, CW
    Auton, TR
    Paolini, GV
    Mee, RP
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1997, 11 (05) : 425 - 445
  • [6] Glide: A new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy
    Friesner, RA
    Banks, JL
    Murphy, RB
    Halgren, TA
    Klicic, JJ
    Mainz, DT
    Repasky, MP
    Knoll, EH
    Shelley, M
    Perry, JK
    Shaw, DE
    Francis, P
    Shenkin, PS
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (07) : 1739 - 1749
  • [7] Knowledge-based scoring function to predict protein-ligand interactions
    Gohlke, H
    Hendlich, M
    Klebe, G
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 295 (02) : 337 - 356
  • [8] Goodsell DS, 1996, J MOL RECOGNIT, V9, P1, DOI 10.1002/(SICI)1099-1352(199601)9:1<1::AID-JMR241>3.0.CO
  • [9] 2-6
  • [10] Glide: A new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening
    Halgren, TA
    Murphy, RB
    Friesner, RA
    Beard, HS
    Frye, LL
    Pollard, WT
    Banks, JL
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (07) : 1750 - 1759