Feature trees: A new molecular similarity measure based on tree matching

被引:230
作者
Rarey, M
Dixon, JS
机构
[1] SmithKline Beecham Pharmaceut, Phys & Struct Chem, King Of Prussia, PA 19406 USA
[2] German Natl Res Ctr Informat Technol, GMD, Inst Algorithms & Sci Comp, SCAI, D-53754 St Augustin, Germany
关键词
database screening; molecular descriptors; molecular similarity; molecular superposition; structural alignment;
D O I
10.1023/A:1008068904628
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this paper we present a new method for evaluating molecular similarity between small organic compounds. Instead of a linear representation like fingerprints, a more complex description, a feature tree, is calculated for a molecule. A feature tree represents hydrophobic fragments and functional groups of the molecule and the way these groups are linked together. Each node in the tree is labeled with a set of features representing chemical properties of the part of the molecule corresponding to the node. The comparison of feature trees is based on matching subtrees of two feature trees onto each other. Two algorithms for tackling the matching problem are described throughout this paper. On a dataset of about 1000 molecules, we demonstrate the ability of our approach to identify molecules belonging to the same class of inhibitors. With a second dataset of 58 molecules with known binding modes taken from the Brookhaven Protein Data Bank, we show that the matchings produced by our algorithms are compatible with the relative orientation of the molecules in the active site in 61% of the test cases. The average computation time for a pair comparison is about 50 ms on a current workstation.
引用
收藏
页码:471 / 490
页数:20
相关论文
共 25 条
[11]  
2-U
[12]   NEW MOLECULAR SHAPE DESCRIPTORS - APPLICATION IN DATABASE SCREENING [J].
GOOD, AC ;
EWING, TJA ;
GSCHWEND, DA ;
KUNTZ, ID .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1995, 9 (01) :1-12
[13]   A genetic algorithm for flexible molecular overlay and pharmacophore elucidation [J].
Jones, G ;
Willett, P ;
Glen, RC .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1995, 9 (06) :532-549
[14]   SOLUTION FOR BEST ROTATION TO RELATE 2 SETS OF VECTORS [J].
KABSCH, W .
ACTA CRYSTALLOGRAPHICA SECTION A, 1976, 32 (SEP1) :922-923
[15]  
Kearsley S.K., 1990, Tetrahedron Comput. Methodol, V3, P615, DOI DOI 10.1016/0898-5529(90)90162-2
[16]   DIFFERENT APPROACHES TOWARD AN AUTOMATIC STRUCTURAL ALIGNMENT OF DRUG MOLECULES - APPLICATIONS TO STEROL MIMICS, THROMBIN AND THERMOLYSIN INHIBITORS [J].
KLEBE, G ;
MIETZNER, T ;
WEBER, F .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1994, 8 (06) :751-778
[17]  
Kubinyi H, 1993, 3D QSAR DRUG DESIGN
[18]   Time-efficient flexible superposition of medium-sized molecules [J].
Lemmen, C ;
Lengauer, T .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1997, 11 (04) :357-368
[19]  
MATTOS C, 1993, 3D QSAR DRUG DESIGN, P226
[20]  
*MDL INF SYST INC, MACCS DRUG DAT REP M