The reduced graph descriptor in virtual screening and data-driven clustering of high-throughput screening data

被引:70
作者
Harper, G [1 ]
Bravi, GS [1 ]
Pickett, SD [1 ]
Hussain, J [1 ]
Green, DVS [1 ]
机构
[1] GlaxoSmithKline, Stevenage SG1 2NY, Herts, England
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2004年 / 44卷 / 06期
关键词
D O I
10.1021/ci049860f
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Virtual screening and high-throughput screening are two major components of lead discovery within the pharmaceutical industry. In this paper we describe improvements to previously published methods for similarity searching with reduced graphs, with a particular focus on ligand-based virtual screening, and describe a novel use of reduced graphs in the clustering of high-throughput screening data. Literature methods for reduced graph similarity searching encode the reduced graphs as binary fingerprints, which has a number of issues. In this paper we extend the definition of the reduced graph to include positively and negatively ionizable groups and introduce a new method for measuring the similarity of reduced graphs based on a weighted edit distance. Moving beyond simple similarity searching, we show how more flexible queries can be built using reduced graphs and describe a database system that allows iterative querying with multiple representations. Reduced graphs capture many important features of ligand-receptor interactions and, in conjunction with other whole molecule descriptors, provide an informative way to review HTS data. We describe a novel use of reduced graphs in this context, introducing a method we have termed data-driven clustering. that identifies clusters of molecules represented by a particular whole molecule descriptor and enriched in active compounds.
引用
收藏
页码:2145 / 2156
页数:12
相关论文
共 29 条
[1]   Further development of reduced graphs for identifying bioactive compounds [J].
Barker, EJ ;
Gardiner, EJ ;
Gillet, VJ ;
Kitts, P ;
Morris, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02) :346-356
[2]  
Barth F., 1994, [No title captured], Patent No. [EP0658546, 0658546]
[3]  
BAYER PHARMACEUTICAL, 2003, Patent No. 0340107
[4]   The properties of known drugs .1. Molecular frameworks [J].
Bemis, GW ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) :2887-2893
[5]   The information content of 2D and 3D structural descriptors relevant to ligand-receptor binding [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01) :1-9
[6]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[7]  
FINKE PE, 2003, Patent No. 03007887
[8]   SIMILARITY SEARCHING ON CAS REGISTRY SUBSTANCES .2. 2D STRUCTURAL SIMILARITY [J].
FISANICK, W ;
LIPKUS, AH ;
RUSINKO, A .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (01) :130-140
[9]   Similarity searching using reduced graphs [J].
Gillet, VJ ;
Willett, P ;
Bradshaw, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02) :338-345
[10]   COMPUTER-STORAGE AND RETRIEVAL OF GENERIC CHEMICAL STRUCTURES IN PATENTS .13. REDUCED GRAPH GENERATION [J].
GILLET, VJ ;
DOWNS, GM ;
HOLLIDAY, JD ;
LYNCH, MF ;
DETHLEFSEN, W .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1991, 31 (02) :260-270