Multidimensional scaling of combinatorial libraries without explicit enumeration

被引:17
作者
Agrafiotis, DK [1 ]
Lobanov, VS [1 ]
机构
[1] Three Dimens Pharmaceut Inc, Exton, PA 19341 USA
关键词
data mining; pattern recognition; dimensionality reduction; feature extraction; multidimensional scaling; nonlinear mapping; Sammon mapping; neural network; multiplayer perceptron; combinatorial network; combinatorial library; combinatorial chemistry; high-throughput screening; virtual screening; molecular similarity; molecular diversity; QSAR; QSPR; molecular descriptor;
D O I
10.1002/jcc.1126
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A novel approach for the multidimensional scaling of large combinatorial libraries is presented. The method employs a multilayer perceptron, which is trained to predict the coordinates of the products on the nonlinear map from pertinent features of their respective building blocks. TI-Lis method limits the expensive enumeration and descriptor generation to only a small fraction of products and, in addition, relieves the enormous computational effort required for the low-dimensional embedding by conventional iterative multidimensional scaling algorithms. In effect, the method provides an explicit mapping function from reagents to products, and allows the vast majority of compounds to be projected without constructing their connection tables. The advantages of this approach are demonstrated using two combinatorial libraries based on the reductive amination and Ugi reactions, and three descriptor sets that are commonly used in similarity searching, diversity profiling and structure-activity correlation. (C) 2001 John Wiley & Sons, Inc.
引用
收藏
页码:1712 / 1722
页数:11
相关论文
共 46 条
[1]  
Agrafiotis D. K., 1998, ENCY COMPUTATIONAL C, V1, P742
[2]   Stochastic algorithms for maximizing molecular diversity [J].
Agrafiotis, DK .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (05) :841-851
[3]  
Agrafiotis DK, 1997, PROTEIN SCI, V6, P287
[4]   Nonlinear mapping networks [J].
Agrafiotis, DK ;
Lobanov, VS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (06) :1356-1362
[5]   Advances in diversity profiling and combinatorial series design [J].
Agrafiotis, DK ;
Myslik, JC ;
Salemme, FR .
MOLECULAR DIVERSITY, 1998, 4 (01) :1-22
[6]  
Agrafiotis DK, 2001, J COMPUT CHEM, V22, P488, DOI 10.1002/1096-987X(20010415)22:5<488::AID-JCC1020>3.0.CO
[7]  
2-4
[8]   On the use of information theory for assessing molecular diversity [J].
Agrafiotis, DK .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (03) :576-580
[9]  
AGRAFIOTIS DK, 2000, VIRTUAL SCREENING BI, P265
[10]  
AGRAFIOTIS DK, 1997, Patent No. 5685711