Calculating the knowledge-based similarity of functional groups using crystallographic data

被引:20
作者
Watson, P
Willett, P
Gillet, VJ
Verdonk, ML
机构
[1] Univ Sheffield, Dept Informat Studies, Krebs Inst Biomol Res, Sheffield S10 2TN, S Yorkshire, England
[2] Cambridge Crystallog Data Ctr, Cambridge CB2 1EZ, England
关键词
bioisoterism; crystallographic databases; knowledge-based similarity; non-bonded interactions; structure-based design; 3D molecular similarity;
D O I
10.1023/A:1013115500749
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A knowledge-based method [or calculating the similarity of functional groups is described and validated. The method is based on experimental information derived from small molecule crystal structures. These data are used in the form of scatterplots that show the likelihood of a non-bonded interaction being formed between functional group A (the 'central group') and functional group B (the 'contact group' or 'probe'). The scatterplots are converted into three-dimensional maps that show the propensity of the probe at different positions around the central group. Here we describe how to calculate the similarity of a pair of central groups based on these maps. The similarity method is validated using bioisosteric functional group pairs identified in the Bioster database and Relibase. The Bioster database is a critical compilation of thousands of bioisosteric molecule pairs, including drugs, enzyme inhibitors and agrochemicals. Relibase is an object-oriented database containing structural data about protein-ligand interactions. The distributions of the similarities of the bioisosteric functional group pairs are compared with similarities for all the possible pairs in IsoStar, and are found to be significantly different. Enrichment factors are also calculated showing the similarity method is statistically significantly better than random in predicting bioisosteric functional group pairs.
引用
收藏
页码:835 / 857
页数:23
相关论文
共 31 条
[1]   THE DEVELOPMENT OF VERSION-3 AND VERSION-4 OF THE CAMBRIDGE STRUCTURAL DATABASE SYSTEM [J].
ALLEN, FH ;
DAVIES, JE ;
GALLOY, JJ ;
JOHNSON, O ;
KENNARD, O ;
MACRAE, CF ;
MITCHELL, EM ;
MITCHELL, GF ;
SMITH, JM ;
WATSON, DG .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1991, 31 (02) :187-204
[2]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[3]  
BOER D, UNPUB
[4]   IsoStar: A library of information about nonbonded interactions [J].
Bruno, IJ ;
Cole, JC ;
Lommerse, JPM ;
Rowland, RS ;
Taylor, R ;
Verdonk, ML .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1997, 11 (06) :525-537
[5]   HOW SIMILAR IS A MOLECULE TO ANOTHER - AN ELECTRON-DENSITY MEASURE OF SIMILARITY BETWEEN 2 MOLECULAR-STRUCTURES [J].
CARBO, R ;
LEYDA, L ;
ARNAU, M .
INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 1980, 17 (06) :1185-1189
[6]  
*CCDC, REL PLUS ALPH VERS
[7]  
DEAN PM, 1998, DESIGNING BIOACTIVE, P199
[8]   SOLVENT INTERACTIONS WITH PI-RING-SYSTEMS IN PROTEINS [J].
FLANAGAN, K ;
WALSHAW, J ;
PRICE, SL ;
GOODFELLOW, JM .
PROTEIN ENGINEERING, 1995, 8 (02) :109-116
[9]  
Friedman H.L., 1951, NASNRS, P295, DOI DOI 10.17226/18474
[10]  
GILBERT AS, 1992, DESIGN DRUGS MACROMO, P25