The Compressed Feature Matrix - a novel descriptor for adaptive similarity search

被引:8
作者
Abolmaali, SFB
Ostermann, C
Zell, A
机构
[1] Univ Tubingen, Dept Comp Sci, D-72076 Tubingen, Germany
[2] ALTANA Pharma AG, D-78467 Constance, Germany
关键词
similarity; descriptor; computer chemistry features; scaffold hopping;
D O I
10.1007/s00894-002-0110-0
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Compressed Feature Matrix (CFM) is a new molecular descriptor for adaptive similarity searching. Depending on the requirements, it is based on a distance or geometry matrix. Thus, the CFM permits topological and three-dimensional comparisons of molecules. In contrast to the common distance matrix, the CFM is based on features instead of atoms. Each kind of these features may be weighted separately, depending on its (estimated) contribution to the biological effect of the molecule. In this work, we show that the CFM allows us to adapt similarity evaluations to particular ligands as well as to classification requirements. The CFM method is analyzed regarding correctness, adaptivity and speed. Applying the basic setting of feature weights, the similarity evaluations using the CFM on the one hand and the Tanimoto coefficient together with MACCS Keys on the other yield similar results. However, in contrast to the latter method, the CFM even permits us to focus on small parts of molecules to serve as a basis for similarity. Accordingly, we have achieved striking results not only by readjusting the feature weights with regard to the scaffold but also to the side chain of the respective target. The results of the latter run turned out to be rather independent of the molecular scaffold. Hence, the CFM is suitable not only for common similarity evaluation, but also for techniques such as lead or scaffold hopping.
引用
收藏
页码:66 / 75
页数:10
相关论文
共 24 条
[1]  
ABOLMAALI SFB, 2001, P 15 MOL MOD WORKSH
[2]  
[Anonymous], 2001, UNITY CHEM INF SOFTW
[3]   EFFECT OF STANDARDIZATION ON FRAGMENT-BASED MEASURES OF STRUCTURAL SIMILARITY [J].
BATH, PA ;
MORRIS, CA ;
WILLETT, P .
JOURNAL OF CHEMOMETRICS, 1993, 7 (06) :543-550
[4]   MOLECULAR-IDENTIFICATION NUMBER FOR SUBSTRUCTURE SEARCHES [J].
BURDEN, FR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1989, 29 (03) :225-227
[5]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[6]  
*CHEM COMP GROUP I, 2001, MOE REL 2001 01
[7]  
*COMP CHEM CENTR, 1998, EPTRA VERS 2 6
[8]   SUBSTRUCTURAL ANALYSIS - NOVEL APPROACH TO PROBLEM OF DRUG DESIGN [J].
CRAMER, RD ;
REDL, G ;
BERKOFF, CE .
JOURNAL OF MEDICINAL CHEMISTRY, 1974, 17 (05) :533-535
[9]   DESCRIPTION OF SEVERAL CHEMICAL-STRUCTURE FILE FORMATS USED BY COMPUTER-PROGRAMS DEVELOPED AT MOLECULAR DESIGN LIMITED [J].
DALBY, A ;
NOURSE, JG ;
HOUNSHELL, WD ;
GUSHURST, AKI ;
GRIER, DL ;
LELAND, BA ;
LAUFER, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (03) :244-255
[10]   SIMILARITY SEARCHING ON CAS REGISTRY SUBSTANCES .1. GLOBAL MOLECULAR PROPERTY AND GENERIC ATOM TRIANGLE GEOMETRIC SEARCHING [J].
FISANICK, W ;
CROSS, KP ;
RUSINKO, A .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (06) :664-674