Efficient generation, storage, and manipulation of fully flexible pharmacophore multiplets and their use in 3-D similarity searching

被引:46
作者
Abrahamian, E
Fox, PC
Nærum, L
Christensen, IT
Thogersen, H
Clark, RD
机构
[1] Tripos Inc, St Louis, MO 63144 USA
[2] Novo Nordisk AS, Med Chem Res, DK-2760 Malov, Denmark
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2003年 / 43卷 / 02期
关键词
D O I
10.1021/ci025595r
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Pharmacophore triplets and quartets have been used by many groups in recent years, primarily as a tool for molecular diversity analysis. In most cases, slow processing speeds and the very large size of the bitsets generated have forced researchers to compromise in terms of how such multiplets were stored, manipulated, and compared, e.g., by using simple unions to represent multiplets for sets of molecules. Here we report using bitmaps in place of bitsets to reduce storage demands and to improve processing speed. Here, a bitset is taken to mean a fully enumerated string of zeros and ones, from which a compressed bitmap is obtained by replacing uniform blocks ("runs") of digits in the bitset with a pair of values identifying the content and length of the block (run-length encoding compression). High-resolution multiplets involving four features are enabled by using 64 bit executables to create and manipulate bitmaps, which "connect" to the 32 bit executables used for database access and feature identification via an extensible mark-up language (XML) data stream. The encoding system used supports simple pairs, triplets, and quartets; multiplets in which a privileged substructure is used as an anchor point; and augmented multiplets in which an additional vertex is added to represent a contingent feature such as a hydrogen bond extension point linked to a complementary feature (e.g., a donor or an acceptor atom) in a base pair or triplet. It can readily be extended to larger, more complex multiplets as well. Database searching is one particular potential application for this technology. Consensus bitmaps built up from active ligands identified in preliminary screening can be used to generate hypothesis bitmaps, a process which includes allowance for differential weighting to allow greater emphasis to be placed on bits arising from multiplets expected to be particularly discriminating. Such hypothesis bitmaps are shown to be useful queries for database searching, successfully retrieving active compounds across a range of structural classes from a corporate database. The current implementation allows multiconformer bitmaps to be obtained from pregenerated conformations or by random perturbation on-the-fly. The latter application involves random sampling of the full range of conformations not precluded by steric clashes, which limits the usefulness of classical fingerprint similarity measures. A new measure of similarity, The Stochastic Cosine, is introduced here to address this need. This new similarity measure uses the average number of bits common to independently drawn conformer sets to normalize the cosine coefficient. Its use frees the user from having to ensure strict comparability of starting conformations and having to use fixed torsional increments, thereby allowing fully flexible characterization of pharmacophoric patterns.
引用
收藏
页码:458 / 468
页数:11
相关论文
共 29 条
  • [1] SYBYL line notation (SLN): A versatile language for chemical structure representation
    Ash, S
    Cline, MA
    Homer, RW
    Hurst, T
    Smith, GB
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01): : 71 - 79
  • [2] Identification of common functional configurations among molecules
    Barnum, D
    Greene, J
    Smellie, A
    Sprague, P
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03): : 563 - 571
  • [3] Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection
    Brown, RD
    Martin, YC
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03): : 572 - 584
  • [4] Synthesis and pharmacological evaluation of novel cis-3,4-diaryl-hydroxychromanes as high affinity partial agonists for the estrogen receptor
    Bury, PS
    Christiansen, LB
    Jacobsen, P
    Jorgensen, AS
    Kanstrup, A
    Nærum, L
    Bain, S
    Fledelius, C
    Gissel, B
    Hansen, BS
    Korsgaard, N
    Thorpe, SM
    Wassermann, K
    [J]. BIOORGANIC & MEDICINAL CHEMISTRY, 2002, 10 (01) : 125 - 145
  • [5] Cato SJ, 2000, IUL BIOTECH, V2, P108
  • [6] Recursive partitioning analysis of a large structure-activity data set using three-dimensional descriptors
    Chen, X
    Rusinko, A
    Young, SS
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06): : 1054 - 1062
  • [7] Synthesis and biological evaluation of novel thio-substituted chromanes as high-affinity partial agonists for the estrogen receptor
    Christiansen, LB
    Wenckens, M
    Bury, PS
    Gissel, B
    Hansen, BS
    Thorpe, SM
    Jacobsen, P
    Kanstrup, A
    Jorgensen, AS
    Nærum, L
    Wassermann, K
    [J]. BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2002, 12 (01) : 17 - 19
  • [8] Coupling structure-based design with combinatorial chemistry: application of active site derived pharmacophores with informative library design
    Eksterowicz, JE
    Evensen, E
    Lemmen, C
    Brady, GP
    Lanctot, JK
    Bradley, EK
    Saiah, E
    Robinson, LA
    Grootenhuis, PDJ
    Blaney, JM
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2002, 20 (06) : 469 - 477
  • [9] Ferguson A. M., 1996, J BIOMOL SCREEN, V1, P65
  • [10] New methodology for profiling combinatorial libraries and screening sets: Cleaning up the design process with HARPick
    Good, AC
    Lewis, RA
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1997, 40 (24) : 3926 - 3936