Ultrafast shape recognition for similarity search in molecular databases

被引:57
作者
Ballester, Pedro J. [1 ]
Richards, W. Graham [1 ]
机构
[1] Univ Oxford, Phys & Theoret Chem Lab, Oxford OX1 3QZ, England
来源
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2007年 / 463卷 / 2081期
关键词
molecular shape comparison; similarity search; pattern recognition; data explosion; virtual screening;
D O I
10.1098/rspa.2007.1823
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Molecular databases are routinely screened for compounds that most closely resemble a molecule of known biological activity to provide,novel drug leads. It is widely believed that three-dimensional molecular shape is the most discriminating pattern for biological activity as it is directly related to the steep repulsive part of the interaction potential between the drug-like molecule and its macromolecular target. However, efficient comparison of molecular shape is currently a challenge. Here, we show that a new approach based on moments of distance distributions is able to recognize molecular shape at least three orders of magnitude faster than current methodologies. Such an ultrafast method permits the identification of similarly shaped compounds within the largest molecular databases. In addition, the problematic requirement of aligning molecules for comparison is circumvented, as the proposed distributions are independent of molecular orientation. Our methodology could be also adapted to tackle similar hard problems in other fields, such as designing content-based Internet search engines for three-dimensional geometrical objects or performing fast similarity comparisons between proteins. From a broader perspective, we anticipate that ultrafast pattern recognition will soon become not only useful, but also essential to address the data explosion currently experienced in most scientific disciplines.
引用
收藏
页码:1307 / 1321
页数:15
相关论文
共 23 条
[11]   A 3D similarity method for scaffold hopping from the known drugs or natural ligands to new chemotypes [J].
Jenkins, JL ;
Glick, M ;
Davies, JW .
JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (25) :6144-6159
[12]   Innovation and greater probability of success in drug discovery and development - from target to biomarkers [J].
Kola, I ;
Hazuda, D .
CURRENT OPINION IN BIOTECHNOLOGY, 2005, 16 (06) :644-646
[13]   Rapid evaluation of molecular shape similarity index using pairwise calculation of the nearest atomic distances [J].
Kotani, T ;
Higashiura, K .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (01) :58-63
[14]   New 4-point pharmacophore method for molecular similarity and diversity applications: Overview of the method and applications, including a novel approach to the design of combinatorial libraries containing privileged substructures [J].
Mason, JS ;
Morize, I ;
Menard, PR ;
Cheney, DL ;
Hulme, C ;
Labaudiniere, RF .
JOURNAL OF MEDICINAL CHEMISTRY, 1999, 42 (17) :3251-3264
[15]   Exceeding human limits [J].
Muggleton, SH .
NATURE, 2006, 440 (7083) :409-410
[16]   NEW METHOD FOR RAPID CHARACTERIZATION OF MOLECULAR SHAPES - APPLICATIONS IN DRUG DESIGN [J].
NILAKANTAN, R ;
BAUMAN, N ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1993, 33 (01) :79-85
[17]   Virtual screening using grid computing: the screensaver project [J].
Richards, WG .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (07) :551-555
[18]   A shape-based 3-D scaffold hopping method and its application to a bacterial protein-protein interaction [J].
Rush, TS ;
Grant, JA ;
Mosyak, L ;
Nicholls, A .
JOURNAL OF MEDICINAL CHEMISTRY, 2005, 48 (05) :1489-1495
[19]   Computational chemistry-driven decision making in lead generation [J].
Schnecke, V ;
Boström, J .
DRUG DISCOVERY TODAY, 2006, 11 (1-2) :43-50
[20]   Science in an exponential world [J].
Szalay, A ;
Gray, J .
NATURE, 2006, 440 (7083) :413-414