A geodesic framework for analyzing molecular similarities

被引:25
作者
Agrafiotis, DK [1 ]
Xu, HF [1 ]
机构
[1] 3 Dimens Pharmaceut Inc, Exton, PA 19341 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2003年 / 43卷 / 02期
关键词
D O I
10.1021/ci025631m
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A fast self-organizing algorithm for extracting the minimum number of independent variables that can fully describe a set of observations was recently described (Agrafiotis, D. K.; Xu, H. Proc. Nad. Acad. Sci. U.S.A. 2002, 99, 15869-15872). The method, called stochastic proximity embedding (SPE), attempts to generate low-dimensional Euclidean maps that best preserve the similarities between a set of related objects. Unlike conventional multidimensional scaling (MDS) and nonlinear mapping (NLM), SPE preserves only local relationships and, by doing so, reveals the intrinsic dimensionality and metric structure of the data. Its success depends critically on the choice of the neighborhood radius, which should be consistent with the local curvature of the underlying manifold. Here, we describe a procedure for determining that radius by examining the tradeoff between the stress function and the number of connected components in the neighborhood graph and show that it can be used to produce meaningful maps in any embedding dimension. The power of the algorithm is illustrated in two major areas of computational drug design: conformational analysis and diversity profiling of large chemical libraries.
引用
收藏
页码:475 / 484
页数:10
相关论文
共 19 条
[1]   Combinatorial informatics in the post-genomics era [J].
Agrafiotis, DK ;
Lobanov, VS ;
Salemme, FR .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (05) :337-346
[2]   A self-organizing principle for learning nonlinear manifolds [J].
Agrafiotis, DK ;
Xu, HF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (25) :15869-15872
[3]  
[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
[4]  
Borg I., 1997, MODERN MULTIDIMENSIO
[5]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[6]   Visualizing substructural fingerprints [J].
Clark, RD ;
Patterson, DE ;
Soltanshahi, F ;
Blake, JF ;
Matthew, JB .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2000, 18 (4-5) :404-+
[7]  
Cormen T. H., 1990, INTRO ALGORITHMS
[8]  
Crippen G. M., 1988, DISTANCE GEOMETRY MO
[9]  
Hall L. H., 1991, Reviews in Computational Chemistry, P367, DOI [10.1002/9780470125793.ch9, DOI 10.1002/9780470125793.CH9]
[10]  
Johnson M., 1990, CONCEPTS APPL MOL SI