A general approach for developing system-specific functions to score protein-ligand docked complexes using support vector inductive logic programming

被引:29
作者
Amini, Ata
Shrimpton, Paul J.
Muggleton, Stephen H.
Sternberg, Michael J. E. [1 ]
机构
[1] Univ London Imperial Coll Sci & Technol, Ctr Bioinformat, Div Mol Biosci, Struct Bioinformat Grp, London SW7 2AY, England
[2] Univ London Imperial Coll Sci & Technol, Dept Computat, Computat Bioinformat Lab, London SW7 2AY, England
关键词
D O I
10.1002/prot.21782
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite the increased recent use of protein-ligand and protein-protein docking in the drug discovery process due to the increases in computational power, the difficulty of accurately ranking the binding affinities of a series of ligands or a series of proteins docked to a protein receptor remains largely unsolved. This problem is of major concern in lead optimization procedures and has lead to the development of scoring functions tailored to rank the binding affinities of a series of ligands to a specific system. However, such methods can take a long time to develop and their transferability to other systems remains open to question. Here we demonstrate that given a suitable amount of background information a new approach using support vector inductive logic programming (SVILP) can be used to produce system-specific scoring functions. Inductive logic programming (ILP) learns logic-based rules for a given dataset that can be used to describe properties of each member of the set in a qualitative manner. By combining ILP with support vector machine regression, a quantitative set Of rules can be obtained. SVILP has previously been used in a biological context to examine datasets containing a series of singular molecular structures and properties. Here we describe the use of SVILP to produce binding affinity predictions of a series of ligands to a particular protein. We also for the first time examine the applicability of SVILP techniques to datasets consisting of protein-ligand complexes. Our results show that SVILP performs comparably with other state-of-the-art methods on five protein-ligand systems as judged by similar cross-validated squares of their correlation coefficients. A McNemar test comparing SVILP to CoMEA and CoMSIA across the five systems indicates our method to be significantly better on one occasion. The ability to graphically display and understand the SVILP-produced rules is demonstrated and this feature of ILP can be used to derive hypothesis for future ligand design in lead optimization procedures. The approach can readily be extended to evaluate the binding affinities of a series of protein-protein complexes.
引用
收藏
页码:823 / 831
页数:9
相关论文
共 57 条
[21]   Principles of docking: An overview of search algorithms and a guide to scoring functions [J].
Halperin, I ;
Ma, BY ;
Wolfson, H ;
Nussinov, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 47 (04) :409-443
[22]   Cyclic urea amides: HIV-1 protease inhibitors with low nanomolar potency against both wild type and protease inhibitor resistant mutants of HIV [J].
Jadhav, PK ;
Ala, P ;
Woerner, FJ ;
Chang, CH ;
Garber, SS ;
Anton, ED ;
Bacheler, LT .
JOURNAL OF MEDICINAL CHEMISTRY, 1997, 40 (02) :181-191
[23]   Expanded interaction fingerprint method for analyzing ligand binding modes in docking and structure-based drug design [J].
Kelly, MD ;
Mancera, RL .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (06) :1942-1951
[24]   Structural aspects of isozyme selectivity in the binding of inhibitors to carbonic anhydrases II and IV [J].
Kim, CY ;
Whittington, DA ;
Chang, JS ;
Liao, J ;
May, JA ;
Christianson, DW .
JOURNAL OF MEDICINAL CHEMISTRY, 2002, 45 (04) :888-893
[25]   Fluoroaromatic-fluoroaromatic interactions between inhibitors bound in the crystal lattice of human carbonic anhydrase II [J].
Kim, CY ;
Chandra, PP ;
Jain, A ;
Christianson, DW .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2001, 123 (39) :9620-9627
[26]   Contribution of fluorine to protein-ligand affinity in the binding of fluoroaromatic inhibitors to carbonic anhydrase II [J].
Kim, CY ;
Chang, JS ;
Doyon, JB ;
Baird, TT ;
Fierke, CA ;
Jain, A ;
Christianson, DW .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2000, 122 (49) :12125-12134
[27]   Structure-activity relationships derived by machine learning: The use of atoms and their bond connectivities to predict mutagenicity by inductive logic programming [J].
King, RD ;
Muggleton, SH ;
Srinivasan, A ;
Sternberg, MJE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (01) :438-442
[28]   DRUG DESIGN BY MACHINE LEARNING - THE USE OF INDUCTIVE LOGIC PROGRAMMING TO MODEL THE STRUCTURE-ACTIVITY-RELATIONSHIPS OF TRIMETHOPRIM ANALOGS BINDING TO DIHYDROFOLATE-REDUCTASE [J].
KING, RD ;
MUGGLETON, S ;
LEWIS, RA ;
STERNBERG, MJE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (23) :11322-11326
[29]   MOLECULAR SIMILARITY INDEXES IN A COMPARATIVE-ANALYSIS (COMSIA) OF DRUG MOLECULES TO CORRELATE AND PREDICT THEIR BIOLOGICAL-ACTIVITY [J].
KLEBE, G ;
ABRAHAM, U ;
MIETZNER, T .
JOURNAL OF MEDICINAL CHEMISTRY, 1994, 37 (24) :4130-4146
[30]   An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes [J].
Kortemme, T ;
Morozov, AV ;
Baker, D .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 326 (04) :1239-1259