SIMILARITY SEARCHING IN DATABASES OF 3-DIMENSIONAL MOLECULES AND MACROMOLECULES

被引:39
作者
ARTYMIUK, PJ
BATH, PA
GRINDLEY, HM
PEPPERRELL, CA
POIRRETTE, AR
RICE, DW
THORNER, DA
WILD, DJ
WILLETT, P
ALLEN, FH
TAYLOR, R
机构
[1] UNIV SHEFFIELD,KREBS INST BIOMOLEC RES,DEPT INFORMAT STUDIES,WESTERN BANK,SHEFFIELD S10 2TN,S YORKSHIRE,ENGLAND
[2] UNIV SHEFFIELD,KREBS INST BIOMOLEC RES,DEPT MOLEC BIOL & BIOTECHNOL,SHEFFIELD S10 2TN,S YORKSHIRE,ENGLAND
[3] CAMBRIDGE CRYSTALLOG DATA CTR,CAMBRIDGE CB2 1EZ,ENGLAND
[4] ICI PLC,AGROCHEM,BRACKNELL RG12 6EY,BERKS,ENGLAND
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1992年 / 32卷 / 06期
关键词
D O I
10.1021/ci00010a007
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper discusses algorithmic techniques for measuring the degree of similarity between pairs of three-dimensional (3-D) chemical molecules represented by interatomic distance matrices. A comparison of four methods for the calculation of 3-D structural similarity suggests that the most effective one is a procedure that identifies pairs of atoms, one from each of the molecules that are being compared, that lie at the center of geometrically-related volumes of 3-D space. This atom mapping method enables the calculation of a wide range of types of intermolecular similarity coefficient, including measures that are based on physicochemical data. Massively-parallel implementations of the method are discussed, using the AMT Distributed Array Processor, that achieve a substantial increase in performance when compared with a sequential implementation on a UNIX workstation. Current work involves the use of angular information and the extension of the method to field-based similarity searching. Similarity searching in 3-D macromolecules is effected by the use of a maximal common subgraph (MCS) isomorphism algorithm with a novel, graph-based representation of the tertiary structures of proteins. This algorithm is being used to identify similarities between the 3-D structures of proteins in the Brookhaven Protein Data Bank; its use is exemplified by searches involving the NAD-binding fold motif.
引用
收藏
页码:617 / 630
页数:14
相关论文
共 54 条
[51]  
WILLETT P, 1991, 3 DIMENSIONAL CHEM S
[52]  
WILLETT P, 1990, PARALLEL DATABASE PR
[53]  
Willett P., 1987, SIMILARITY CLUSTERIN
[54]  
[No title captured]