SIMILARITY SEARCHING IN DATABASES OF 3-DIMENSIONAL MOLECULES AND MACROMOLECULES

被引:39
作者
ARTYMIUK, PJ
BATH, PA
GRINDLEY, HM
PEPPERRELL, CA
POIRRETTE, AR
RICE, DW
THORNER, DA
WILD, DJ
WILLETT, P
ALLEN, FH
TAYLOR, R
机构
[1] UNIV SHEFFIELD,KREBS INST BIOMOLEC RES,DEPT INFORMAT STUDIES,WESTERN BANK,SHEFFIELD S10 2TN,S YORKSHIRE,ENGLAND
[2] UNIV SHEFFIELD,KREBS INST BIOMOLEC RES,DEPT MOLEC BIOL & BIOTECHNOL,SHEFFIELD S10 2TN,S YORKSHIRE,ENGLAND
[3] CAMBRIDGE CRYSTALLOG DATA CTR,CAMBRIDGE CB2 1EZ,ENGLAND
[4] ICI PLC,AGROCHEM,BRACKNELL RG12 6EY,BERKS,ENGLAND
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1992年 / 32卷 / 06期
关键词
D O I
10.1021/ci00010a007
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper discusses algorithmic techniques for measuring the degree of similarity between pairs of three-dimensional (3-D) chemical molecules represented by interatomic distance matrices. A comparison of four methods for the calculation of 3-D structural similarity suggests that the most effective one is a procedure that identifies pairs of atoms, one from each of the molecules that are being compared, that lie at the center of geometrically-related volumes of 3-D space. This atom mapping method enables the calculation of a wide range of types of intermolecular similarity coefficient, including measures that are based on physicochemical data. Massively-parallel implementations of the method are discussed, using the AMT Distributed Array Processor, that achieve a substantial increase in performance when compared with a sequential implementation on a UNIX workstation. Current work involves the use of angular information and the extension of the method to field-based similarity searching. Similarity searching in 3-D macromolecules is effected by the use of a maximal common subgraph (MCS) isomorphism algorithm with a novel, graph-based representation of the tertiary structures of proteins. This algorithm is being used to identify similarities between the 3-D structures of proteins in the Brookhaven Protein Data Bank; its use is exemplified by searches involving the NAD-binding fold motif.
引用
收藏
页码:617 / 630
页数:14
相关论文
共 54 条
[1]  
ABOLA EE, 1987, CRYSTALLOGRAPHIC DAT, P107
[2]   STRUCTURAL RESEMBLANCE BETWEEN THE FAMILIES OF BACTERIAL SIGNAL-TRANSDUCTION PROTEINS AND OF G-PROTEINS REVEALED BY GRAPH THEORETICAL TECHNIQUES [J].
ARTYMIUK, PJ ;
RICE, DW ;
MITCHELL, EM ;
WILLETT, P .
PROTEIN ENGINEERING, 1990, 4 (01) :39-43
[3]   SEARCHING TECHNIQUES FOR DATABASES OF PROTEIN SECONDARY STRUCTURES [J].
ARTYMIUK, PJ ;
RICE, DW ;
MITCHELL, EM ;
WILLETT, P .
JOURNAL OF INFORMATION SCIENCE, 1989, 15 (4-5) :287-298
[4]   3-DIMENSIONAL STRUCTURAL RESEMBLANCE BETWEEN LEUCINE AMINOPEPTIDASE AND CARBOXYPEPTIDASE-A REVEALED BY GRAPH-THEORETICAL TECHNIQUES [J].
ARTYMIUK, PJ ;
GRINDLEY, HM ;
PARK, JE ;
RICE, DW ;
WILLETT, P .
FEBS LETTERS, 1992, 303 (01) :48-52
[5]  
ARTYMIUK PJ, 1991, RECENT ADV CHEM INFO, P91
[6]  
ASH JE, 1991, CHEM STRUCTURE SYSTE
[7]   RECENT PROGRESS ON THE STRUCTURE AND FUNCTION OF GLUTAMATE-DEHYDROGENASE [J].
BAKER, PJ ;
FARRANTS, GW ;
RICE, DW ;
STILLMAN, TJ .
BIOCHEMICAL SOCIETY TRANSACTIONS, 1987, 15 (04) :748-751
[8]  
Barrow H. G., 1976, Information Processing Letters, V4, P83, DOI 10.1016/0020-0190(76)90049-1
[9]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[10]  
BIRKTOFT JJ, 1984, PEPTIDE PROTEIN REV, P61