Similarity search profiling reveals effects of fingerprint scaling in virtual screening

被引:33
作者
Xue, L
Stahura, FL
Bajorath, E
机构
[1] BRC, AMRI, Dept Comp Aided Drug Discovery, Bothell, WA 98011 USA
[2] Univ Washington, Dept Biol Struct, Seattle, WA 98195 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2004年 / 44卷 / 06期
关键词
D O I
10.1021/ci0400819
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Fingerprint scaling is a method to increase the performance of similarity search calculations. It is based on the detection of bit patterns in keyed fingerprints that are si-natures of specific compound classes. Application of scaling factors to consensus hits that are mostly set on emphasizes signature bit patterns during similarity searching and has been shown to improve search results for different fingerprints. Similarity search profiling has recently been introduced as a method to analyze similarity search calculations. Profiles separately monitor correctly identified hits and other detected database compounds as a function of similarity threshold values and make it possible to estimate whether virtual screening calculations can be Successful or to evaluate why they fail. This similarity search profile technique has been applied here to Study fingerprint sealing in detail and better understand effects that are responsible for its performance. In particular, we have focused on the qualitative and quantitative analysis of similarity search profiles under scaling conditions. Therefore, we have carried out systematic similarity search calculations for 23 biological activity classes under scaling conditions over a wide range of scaling factors in a compound database containing similar to1.3 million molecules and monitored these calculations in similarity search profiles. Analysis of these profiles confirmed increases in hit rates as a consequence of scaling and revealed that scaling influences similarity search calculations in different ways. Based on scaled similarity search profiles. compound sets could be divided into different categories. In a number of cases, increases in search performance Under scaling conditions were due to a more significant relative increase in correctly identified hits than detected false-positives. This was also consistent with the finding that preferred similarity threshold values increased due to fingerprint scaling, which was well illustrated by similarity search profiling.
引用
收藏
页码:2032 / 2039
页数:8
相关论文
共 22 条
[1]   Integration of virtual and high-throughput screening [J].
Bajorath, F .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (11) :882-894
[2]   Selected concepts and investigations in compound classification, molecular descriptor analysis, and virtual screening [J].
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (02) :233-245
[3]   SUBSTRUCTURAL ANALYSIS - NOVEL APPROACH TO PROBLEM OF DRUG DESIGN [J].
CRAMER, RD ;
REDL, G ;
BERKOFF, CE .
JOURNAL OF MEDICINAL CHEMISTRY, 1974, 17 (05) :533-535
[4]   Effectiveness of retrieval in similarity searches of chemical databases: A review of performance measures [J].
Edgar, SJ ;
Holliday, JD ;
Willett, P .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2000, 18 (4-5) :343-357
[5]   Similarity searching using reduced graphs [J].
Gillet, VJ ;
Willett, P ;
Bradshaw, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02) :338-345
[6]   Identification of biological activity profiles using substructural analysis and genetic algorithms [J].
Gillet, VJ ;
Willett, P ;
Bradshaw, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (02) :165-179
[7]   Recursive median partitioning for virtual screening of large databases [J].
Godden, JW ;
Furr, JR ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (01) :182-188
[8]  
GODDEN JW, 2000, PAC S BIOCOMPUT, V5, P566
[9]  
Holliday JD, 2002, COMB CHEM HIGH T SCR, V5, P155
[10]  
James CA, 1995, Daylight theory manual. daylight chemical information systems