Visual Characterization and Diversity Quantification of Chemical Libraries: 1. Creation of Delimited Reference Chemical Subspaces

被引:24
作者
Le Guilloux, Vincent [1 ]
Colliandre, Lionel [1 ]
Bourg, Stephane [2 ]
Guenegou, Guillaume [1 ]
Dubois-Chevalier, Julie [1 ,3 ]
Morin-Allory, Luc [1 ]
机构
[1] Univ Orleans, CNRS, UMR 6005, ICOA, F-45067 Orleans 2, France
[2] Univ Orleans, CNRS, FR 2708, Federat Rech Phys & Chim Vivant, F-45071 Orleans 2, France
[3] Univ Orleans, LIFO, F-45067 Orleans 2, France
关键词
DIFFERENTIAL SHANNON ENTROPY; MEDICINAL CHEMISTRY SPACE; DEVELOPMENT KIT CDK; SOURCE [!text type='JAVA']JAVA[!/text] LIBRARY; DRUG DISCOVERY; MOLECULAR DESCRIPTORS; COMBINATORIAL LIBRARIES; COMPOUND DATABASES; LEAD DISCOVERY; SCREENING LIBRARIES;
D O I
10.1021/ci200051r
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
High-throughput screening (HTS) is a well-established technology which can test up to several million compounds in a few weeks. Despite these appealing capabilities, available resources and high costs may limit the number of molecules screened, making diversity analysis a method of choice to design and prioritize screening libraries. With a constantly increasing number of molecules available for screening, chemical space has become a key concept for visualizing, analyzing, and comparing chemical libraries. In this first article, we present a new method to build delimited reference chemical subspaces (DRCS). A set of 16 million screening compounds from 73 chemical providers has been gathered, resulting in a database of 6.63 million standardized and unique molecules. These molecules have been used to create three DRCS using three different sets of chemical descriptors. A robust principal component analysis model for each space has been obtained, whereby molecules are projected in a reduced two-dimensional viewable space. The specificity of our approach is that each reduced space has been delimited by a representative contour encompassing a very large proportion of molecules and reflecting its overall shape. The methodology is illustrated by mapping and comparing various chemical libraries. Several tools used in these studies are made freely available, thus enabling any user to compute DRCS matching specific requirements.
引用
收藏
页码:1762 / 1774
页数:13
相关论文
共 100 条
[1]  
Accelrys, 2010, PIP PIL
[2]  
Agrafiotis DK, 2001, J COMPUT CHEM, V22, P488, DOI 10.1002/1096-987X(20010415)22:5<488::AID-JCC1020>3.0.CO
[3]  
2-4
[4]  
*AKOS CONS SOL GMB, CMC
[5]   Toward general methods of targeted library design: Topomer shape similarity searching with diverse structures as queries [J].
Andrews, KM ;
Cramer, RD .
JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (09) :1723-1740
[6]  
[Anonymous], 2009, MOE VERS 2009 10
[7]  
[Anonymous], 2010, INCHI 1 03
[8]   The One-Class Classification Approach to Data Description and to Models Applicability Domain [J].
Baskin, Igor I. ;
Kireeva, Natalia ;
Varnek, Alexandre .
MOLECULAR INFORMATICS, 2010, 29 (8-9) :581-587
[9]   Drug-like annotation and duplicate analysis of a 23-supplier chemical database totalling 2.7 million compounds [J].
Baurin, N ;
Baker, R ;
Richardson, C ;
Chen, I ;
Foloppe, N ;
Potter, A ;
Jordan, A ;
Roughley, S ;
Parratt, M ;
Greaney, P ;
Morley, D ;
Hubbard, RE .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (02) :643-651
[10]   970 Million Druglike Small Molecules for Virtual Screening in the Chemical Universe Database GDB-13 [J].
Blum, Lorenz C. ;
Reymond, Jean-Louis .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2009, 131 (25) :8732-+