Mini-fingerprints for virtual screening: Design principles and generation of novel prototypes based on information theory

被引:24
作者
Xue, L
Godden, JW
Bajorath, J
机构
[1] Albany Mol Res Inc, Dept Comp Aided Drug Discovery, Bothell Res Ctr, Bothell, WA 98011 USA
[2] Univ Washington, Dept Biol Struct, Seattle, WA 98195 USA
关键词
biological activity; fingerprint design; information theory; molecular descriptors; molecular similarity; virtual screening;
D O I
10.1080/1062936021000058764
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Binary fingerprint representations of molecular structure and properties are convenient computational tools for similarity searching in compound databases and virtual screening (VS). We are investigating the design of relatively simple fingerprints for the identification of molecules having similar biological activity and recognition of remote similarity relationships. Since our designs are considerably shorter than other fingerprints used in VS, we have previously termed them "mini-fingerprints" (MFPs). A key aspect of the design strategy is the identification of suitable molecular descriptors. Whereas our initial fingerprint designs have relied on descriptor combinations that performed well in compound classification according to biological activity, second generation MFPs encode combinations of descriptors with high information content in large compound databases and high frequency of occurrence in drug-like molecules. Thus, the design of these new fingerprints does not depend on the analysis of specific classes of bioactive compounds, but rather on descriptor information content in large compound databases. Systematic evaluation of fingerprint performance in VS test calculations demonstrates that these new prototypes perform better than previously generated MFPs. The analysis described herein provides an example for the development of search tools for VS.
引用
收藏
页码:27 / 40
页数:14
相关论文
共 28 条
[1]   Rational drug discovery revisited: interfacing experimental programs with bio- and chemo-informatics [J].
Bajorath, J .
DRUG DISCOVERY TODAY, 2001, 6 (19) :989-995
[2]   Selected concepts and investigations in compound classification, molecular descriptor analysis, and virtual screening [J].
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (02) :233-245
[3]   A rapid computational method for lead evolution:: Description and application to α1-adrenergic antagonists [J].
Bradley, EK ;
Beroza, P ;
Penzotti, JE ;
Grootenhuis, PDJ ;
Spellmeyer, DC ;
Miller, JL .
JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (14) :2770-2774
[4]  
BROWN RD, 1997, J CHEM INF COMP SCI, V37, P731
[5]  
FOX S, 2002, DURG DISCOV WORL SUM, P24
[6]  
Godden J W, 2000, Pac Symp Biocomput, P566
[7]   Variability of molecular descriptors in compound databases revealed by Shannon entropy calculations [J].
Godden, JW ;
Stahura, FL ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (03) :796-800
[8]   Chemical descriptors with distinct levels of information content and varying sensitivity to differences between selected compound databases identified by SE-DSE analysis [J].
Godden, JW ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (01) :87-93
[9]   Differential shannon entropy as a sensitive measure of differences in database variability of molecular descriptors [J].
Godden, JW ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (04) :1060-1066
[10]  
James CA, 1995, Daylight theory manual. daylight chemical information systems