A general approach for developing system-specific functions to score protein-ligand docked complexes using support vector inductive logic programming

被引:29
作者
Amini, Ata
Shrimpton, Paul J.
Muggleton, Stephen H.
Sternberg, Michael J. E. [1 ]
机构
[1] Univ London Imperial Coll Sci & Technol, Ctr Bioinformat, Div Mol Biosci, Struct Bioinformat Grp, London SW7 2AY, England
[2] Univ London Imperial Coll Sci & Technol, Dept Computat, Computat Bioinformat Lab, London SW7 2AY, England
关键词
D O I
10.1002/prot.21782
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite the increased recent use of protein-ligand and protein-protein docking in the drug discovery process due to the increases in computational power, the difficulty of accurately ranking the binding affinities of a series of ligands or a series of proteins docked to a protein receptor remains largely unsolved. This problem is of major concern in lead optimization procedures and has lead to the development of scoring functions tailored to rank the binding affinities of a series of ligands to a specific system. However, such methods can take a long time to develop and their transferability to other systems remains open to question. Here we demonstrate that given a suitable amount of background information a new approach using support vector inductive logic programming (SVILP) can be used to produce system-specific scoring functions. Inductive logic programming (ILP) learns logic-based rules for a given dataset that can be used to describe properties of each member of the set in a qualitative manner. By combining ILP with support vector machine regression, a quantitative set Of rules can be obtained. SVILP has previously been used in a biological context to examine datasets containing a series of singular molecular structures and properties. Here we describe the use of SVILP to produce binding affinity predictions of a series of ligands to a particular protein. We also for the first time examine the applicability of SVILP techniques to datasets consisting of protein-ligand complexes. Our results show that SVILP performs comparably with other state-of-the-art methods on five protein-ligand systems as judged by similar cross-validated squares of their correlation coefficients. A McNemar test comparing SVILP to CoMEA and CoMSIA across the five systems indicates our method to be significantly better on one occasion. The ability to graphically display and understand the SVILP-produced rules is demonstrated and this feature of ILP can be used to derive hypothesis for future ligand design in lead optimization procedures. The approach can readily be extended to evaluate the binding affinities of a series of protein-protein complexes.
引用
收藏
页码:823 / 831
页数:9
相关论文
共 57 条
[1]   A novel logic-based approach for quantitative toxicology prediction [J].
Amini, Ata ;
Muggleton, Stephen H. ;
Lodhi, Huma ;
Sternberg, Michael J. E. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (03) :998-1006
[2]   The use of consensus scoring in ligand-based virtual screening [J].
Baber, JC ;
William, AS ;
Gao, YH ;
Feher, M .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (01) :277-288
[3]   Unexpected binding mode of a cyclic sulfamide HIV-1 protease inhibitor [J].
Backbro, K ;
Lowgren, S ;
Osterlund, K ;
Atepo, J ;
Unge, T ;
Hulten, J ;
Bonham, NM ;
Schaal, W ;
Karlen, A ;
Hallberg, A .
JOURNAL OF MEDICINAL CHEMISTRY, 1997, 40 (06) :898-902
[4]   Similarity searching of chemical databases using atom environment descriptors (MOLPRINT 2D): Evaluation of performance [J].
Bender, A ;
Mussa, HY ;
Glen, RC ;
Reiling, S .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (05) :1708-1718
[5]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[6]   Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations [J].
Bissantz, C ;
Folkers, G ;
Rognan, D .
JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (25) :4759-4767
[7]   Three-dimensional quantitative structure-activity relationship analyses using comparative molecular field analysis and comparative molecular similarity indices analysis to elucidate selectivity differences of inhibitors binding to trypsin, thrombin, and factor Xa [J].
Böhm, M ;
Stürzebecher, J ;
Klebe, G .
JOURNAL OF MEDICINAL CHEMISTRY, 1999, 42 (03) :458-477
[8]   Structural analysis of inhibitor binding to human carbonic anhydrase II [J].
Boriack-Sjodin, PA ;
Zeitlin, S ;
Chen, HH ;
Crenshaw, L ;
Gross, S ;
Dantanarayana, A ;
Delgado, P ;
May, JA ;
Dean, T ;
Christianson, DW .
PROTEIN SCIENCE, 1998, 7 (12) :2483-2489
[9]   Support vector inductive logic programming outperforms the naive Bayes classifier and inductive logic programming for the classification of bioactive chemical compounds [J].
Cannon, Edward O. ;
Amini, Ata ;
Bender, Andreas ;
Sternberg, Michael J. E. ;
Muggleton, Stephen H. ;
Glen, Robert C. ;
Mitchell, John B. O. .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2007, 21 (05) :269-280
[10]   Consensus scoring: A method for obtaining improved hit rates from docking databases of three-dimensional structures into proteins [J].
Charifson, PS ;
Corkery, JJ ;
Murcko, MA ;
Walters, WP .
JOURNAL OF MEDICINAL CHEMISTRY, 1999, 42 (25) :5100-5109