Computation of the physio-chemical properties and data mining of large molecular collections

被引:42
作者
Cheng, A [1 ]
Diller, DJ [1 ]
Dixon, SL [1 ]
Egan, WJ [1 ]
Lauri, G [1 ]
Merz, KM [1 ]
机构
[1] Pharmacopeia Inc, Princeton, NJ 08543 USA
关键词
data mining; physico-chemical properties; large molecular collections;
D O I
10.1002/jcc.1164
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Very large data sets of molecules screened against a broad range of targets have become available due to the advent of combinatorial chemistry. This information has led to the realization that ADME (absorption, distribution, metabolism, and excretion) and toxicity issues are important to consider prior to library, synthesis. Furthermore, these large data sets provide a unique and important source of information regarding what types of molecular shapes may interact with specific receptor or target classes. Thus, the requirement for rapid and accurate data mining tools became paramount. To address these issues Pharmacopeia, Inc. formed a computational research group, The Center for Informatics and Drug Discovery (CIDD)*. In this review we cover the work done by this group to address both in silico ADME modeling and data mining issues faced by Pharmacopeia because of the availability of a large and diverse collection (over 6 million discrete compounds) of drug-like molecules. In particular, in the data mining arena we discuss rapid docking tools and how we employ them, and we describe a novel data mining tool based on a 1D representation of a molecule followed by a molecular sequence alignment step. For the ADME area we discuss the development and application of absorption, blood-brain barrier (BBB) and solubility models. Finally, we summarize the impact the tools and approaches might have on the drug discovery process. (C) 2002 John Wiley & Sons, Inc.
引用
收藏
页码:172 / 183
页数:12
相关论文
共 80 条
[1]   HYDROGEN-BONDING .33. FACTORS THAT INFLUENCE THE DISTRIBUTION OF SOLUTES BETWEEN BLOOD AND BRAIN [J].
ABRAHAM, MH ;
CHADHA, HS ;
MITCHELL, RC .
JOURNAL OF PHARMACEUTICAL SCIENCES, 1994, 83 (09) :1257-1268
[2]   Can we learn to distinguish between "drug-like" and "nondrug-like" molecules? [J].
Ajay ;
Walters, WP ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1998, 41 (18) :3314-3324
[3]  
ARLINGTON S, 2000, PHARM EXEC JAN, P74
[4]   CRYSTAL-STRUCTURES OF NATIVE AND INHIBITED FORMS OF HUMAN CATHEPSIN-D - IMPLICATIONS FOR LYSOSOMAL TARGETING AND DRUG DESIGN [J].
BALDWIN, ET ;
BHAT, TN ;
GULNIK, S ;
HOSUR, MV ;
SOWDER, RC ;
CACHAU, RE ;
COLLINS, J ;
SILVA, AM ;
ERICKSON, JW .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (14) :6796-6800
[5]   Crystal structure of the novel aspartic proteinase zymogen proplasmepsin II from Plasmodium falciparum [J].
Bernstein N.K. ;
Cherney M.M. ;
Loetscher H. ;
Ridley R.G. ;
James M.N.G. .
Nature Structural Biology, 1999, 6 (1) :32-37
[6]  
Bohacek RS, 1996, MED RES REV, V16, P3, DOI 10.1002/(SICI)1098-1128(199601)16:1<3::AID-MED1>3.3.CO
[7]  
2-D
[8]   LUDI - RULE-BASED AUTOMATIC DESIGN OF NEW SUBSTITUENTS FOR ENZYME-INHIBITOR LEADS [J].
BOHM, HJ .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1992, 6 (06) :593-606
[9]  
Borg I., 1997, MODERN MULTIDIMENSIO
[10]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584