Inverse Frequency Weighting of Fragments for Similarity-Based Virtual Screening

被引:8
作者
Arif, Shereena M. [1 ,2 ]
Holliday, John D. [1 ]
Willett, Peter [1 ]
机构
[1] Univ Sheffield, Informat Sch, Sheffield S10 2TN, S Yorkshire, England
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Ukm Bangi 43600, Malaysia
关键词
CLASSIFICATION; PERFORMANCE;
D O I
10.1021/ci1001235
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
This paper discusses the weighting of two-dimensional fingerprints for similarity-based virtual screening, specifically the use of weights that assign greatest importance to the substructural fragments that occur least frequently in the database that is being screened. Virtual screening experiments using the MDL Drug Data Report and World of Molecular Bioactivity databases show that the use of such inverse frequency weighting schemes can result, in some circumstances, in marked increases in screening effectiveness when compared with the use of conventional, unweighted fingerprints. Analysis of the characteristics of the various schemes demonstrates that such weights are best used to weight the fingerprint of the reference structure in a similarity search, with the database structures' fingerprints unweighted. However, the increases in performance resulting from such weights are only observed with structurally homogeneous sets of active molecules; when the actives arc diverse, the best results arc obtained using conventional, unweightecl fingerprints for both the reference structure and the database structures.
引用
收藏
页码:1340 / 1349
页数:10
相关论文
共 33 条
[1]   Similarity-Based Virtual Screening with a Bayesian Inference Network [J].
Abdo, Ammar ;
Salim, Naomie .
CHEMMEDCHEM, 2009, 4 (02) :210-218
[2]   COMPARISON OF PERFORMANCE OF SOME SIMILARITY AND DISSIMILARITY MEASURES IN AUTOMATIC CLASSIFICATION OF CHEMICAL STRUCTURES [J].
ADAMSON, GW ;
BUSH, JA .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1975, 15 (01) :55-58
[3]  
[Anonymous], 2007, An introduction to chemoinformatics
[4]   Analysis and use of fragment-occurrence data in similarity-based virtual screening [J].
Arif, Shereena M. ;
Holliday, John D. ;
Willett, Peter .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2009, 23 (09) :655-668
[5]   Molecular similarity searching using atom environments, information-based feature selection, and a naive Bayesian classifier [J].
Bender, A ;
Mussa, HY ;
Glen, RC ;
Reiling, S .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (01) :170-178
[6]   On scaffolds and hopping in medicinal chemistry [J].
Brown, Nathan ;
Jacoby, Edgar .
MINI-REVIEWS IN MEDICINAL CHEMISTRY, 2006, 6 (11) :1217-1229
[7]   Evaluation of a Bayesian inference network for ligand-based virtual screening [J].
Chen, Beining ;
Mueller, Christoph ;
Willett, Peter .
JOURNAL OF CHEMINFORMATICS, 2009, 1
[8]   SLASH: A program for analysing the functional groups in molecules [J].
Cosgrove, DA ;
Willett, P .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 1998, 16 (01) :19-32
[9]   Molecular similarity analysis in virtual screening: foundations, limitations and novel approaches [J].
Eckert, Hanna ;
Bojorath, Juergen .
DRUG DISCOVERY TODAY, 2007, 12 (5-6) :225-233
[10]  
Gardiner E J., 2009, Stat. Anal. Data Min, V2, P103, DOI [DOI 10.1002/SAM.10037, 10.1002/sam.10037]