ENHANCING THE DIVERSITY OF A CORPORATE DATABASE USING CHEMICAL DATABASE CLUSTERING AND ANALYSIS

被引:87
作者
SHEMETULSKIS, NE [1 ]
DUNBAR, JB [1 ]
DUNBAR, BW [1 ]
MORELAND, DW [1 ]
HUMBLET, C [1 ]
机构
[1] WARNER LAMBERT PARKE DAVIS,PARKE DAVIS PHARMACEUT RES DIV,ANN ARBOR,MI 48105
关键词
MOLECULAR DIVERSITY; STRUCTURAL DATABASES; JARVIS-PATRICK; CLUSTERING; OCTANOL-WATER PARTITION COEFFICIENT; MOLAR REFRACTIVITY; DIPOLE MOMENT;
D O I
10.1007/BF00123998
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The contribution that the Chemical Abstracts structural database (CAST-3D) and the Maybridge database (MAY) would make to diversifying the structural information and property space spanned by our corporate database (CBI) is assessed. A subset of the CAST-3D database has been selected to augment the structural diversity of various electronic databases used in computer-assisted drug design projects. The analysis of the MAY database directly offers the potential to expand the CBI compound library, but also provides a source for structural diversity in a format suitable for computer-assisted database searching and molecular design. The analysis performed is twofold. First, a nonhierarchical clustering technique available in the Daylight clustering package is applied to evaluate the structural differences between databases. The comparison is then extended to analyze various structure-derived property spaces calculated from molecular descriptors such as the logarithm of the octanol-water partition coefficient (CLOGP), the molar refractivity (CMR) and the electronic dipole moment (CDM). The diversity contribution of each database to these property spaces is quantified in relation to our corporate database.
引用
收藏
页码:407 / 416
页数:10
相关论文
共 14 条
[1]   CLUSTERING OF CHEMICAL STRUCTURES ON THE BASIS OF 2-DIMENSIONAL SIMILARITY MEASURES [J].
BARNARD, JM ;
DOWNS, GM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (06) :644-649
[2]  
BROWN R, UNPUB
[3]   SIMILARITY SEARCHING AND CLUSTERING OF CHEMICAL-STRUCTURE DATABASES USING MOLECULAR PROPERTY DATA [J].
DOWNS, GM ;
WILLETT, P ;
FISANICK, W .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (05) :1094-1102
[4]   VALIDITY STUDIES IN CLUSTERING METHODOLOGIES [J].
DUBES, R ;
JAIN, AK .
PATTERN RECOGNITION, 1979, 11 (04) :235-254
[5]  
HALL LH, 1993, MOLCONN X V 2
[6]  
JAMES CA, 1995, DAYLIGHT THEORY MANU, P43
[7]   CLUSTERING USING A SIMILARITY MEASURE BASED ON SHARED NEAR NEIGHBORS [J].
JARVIS, RA ;
PATRICK, EA .
IEEE TRANSACTIONS ON COMPUTERS, 1973, C-22 (11) :1025-1034
[8]   MEASURING DIVERSITY - EXPERIMENTAL-DESIGN OF COMBINATORIAL LIBRARIES FOR DRUG DISCOVERY [J].
MARTIN, EJ ;
BLANEY, JM ;
SIANI, MA ;
SPELLMEYER, DC ;
WONG, AK ;
MOOS, WH .
JOURNAL OF MEDICINAL CHEMISTRY, 1995, 38 (09) :1431-1436
[9]  
PEARLMAN RS, 1994, CONCORD
[10]   SMILES, A CHEMICAL LANGUAGE AND INFORMATION-SYSTEM .1. INTRODUCTION TO METHODOLOGY AND ENCODING RULES [J].
WEININGER, D .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1988, 28 (01) :31-36