Effectiveness of graph-based and fingerprint-based similarity measures for virtual screening of 2D chemical structure databases

被引:94
作者
Raymond, JW
Willett, P
机构
[1] Univ Sheffield, Krebs Inst Biomolec Res, Sheffield S10 2TN, S Yorkshire, England
[2] Univ Sheffield, Dept Informat Studies, Sheffield S10 2TN, S Yorkshire, England
[3] Pfizer Global Res & Dev, Ann Arbor Labs, Ann Arbor, MI 48105 USA
关键词
fingerprint; graph matching; maximum common edge subgraph; maximum overlapping set; RASCAL; similarity coefficient; similarity searching; virtual screening;
D O I
10.1023/A:1016387816342
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This paper reports an evaluation of both graph-based and fingerprint-based measures of structural similarity, when used for virtual screening of sets of 2D molecules drawn from the MDDR and ID Alert databases. The graph-based measures employ a new maximum common edge subgraph isomorphism algorithm, called RASCAL, with several similarity coefficients described previously for quantifying the similarity between pairs of graphs. The effectiveness of these graph-based searches is compared with that resulting from similarity searches using BCI, Daylight and Unity 2D fingerprints. Our results suggest that graph-based approaches provide an effective complement to existing fingerprint-based approaches to virtual screening.
引用
收藏
页码:59 / 71
页数:13
相关论文
共 29 条
[1]  
[Anonymous], 1985, CAS PEST MAT FYS
[2]  
[Anonymous], MOL SIMILARITY DRUG
[3]  
Bohm H.J., 2000, VIRTUAL SCREENING BI
[4]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[5]   A graph distance metric based on the maximal common subgraph [J].
Bunke, H ;
Shearer, K .
PATTERN RECOGNITION LETTERS, 1998, 19 (3-4) :255-259
[6]   On a relation between graph edit distance and maximum common subgraph [J].
Bunke, H .
PATTERN RECOGNITION LETTERS, 1997, 18 (08) :689-694
[7]  
DOWNS GM, 1995, REV COMP CH, V7, P1
[8]   Effectiveness of retrieval in similarity searches of chemical databases: A review of performance measures [J].
Edgar, SJ ;
Holliday, JD ;
Willett, P .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2000, 18 (4-5) :343-357
[9]  
Ellis D, 1993, PERSPECTIVES INFORMA, V3, P128, DOI DOI 10.1007/978-1-4471-2099-5_6
[10]   On the properties of bit string-based measures of chemical similarity [J].
Flower, DR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (03) :379-386