Nonlinear mapping networks

被引:46
作者
Agrafiotis, DK [1 ]
Lobanov, VS [1 ]
机构
[1] 3 Dimens Pharmaceut Inc, Exton, PA 19341 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2000年 / 40卷 / 06期
关键词
D O I
10.1021/ci000033y
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Among the many dimensionality reduction techniques that have appeared in the statistical literature, multidimensional scaling and nonlinear mapping are unique for their conceptual simplicity and ability to reproduce the topology and structure of the data space in a faithful and unbiased manner. However, a major shortcoming of these methods is their quadratic dependence on the number of objects scaled, which imposes severe limitations on the size of data sets that can be effectively manipulated. Here we describe a novel approach that combines conventional nonlinear mapping techniques with feed-forward neural networks, and allows the processing of data sets orders of magnitude larger than those accessible with conventional methodologies. Rooted on the principle of probability sampling, the method employs a classical algorithm to project a small random sample, and then "learns" the underlying nonlinear transform using a multilayer neural network trained with the back-propagation algorithm. Once trained, the neural network can be used in a feed-forward manner to project the remaining members of the population as well as new, unseen samples with minimal distortion. Using examples from the fields of image processing and combinatorial chemistry, we demonstrate that this method can generate projections that are virtually indistinguishable from those derived by conventional approaches. The ability to encode the nonlinear transform in the form of a neural network makes nonlinear mapping applicable to a wide variety of data mining applications involving very large data sets that are otherwise computationally intractable.
引用
收藏
页码:1356 / 1362
页数:7
相关论文
共 25 条
[1]  
Agrafiotis D. K., 1998, ENCY COMPUTATIONAL C, V1, P742
[2]  
Agrafiotis DK, 1997, PROTEIN SCI, V6, P287
[3]   Advances in diversity profiling and combinatorial series design [J].
Agrafiotis, DK ;
Myslik, JC ;
Salemme, FR .
MOLECULAR DIVERSITY, 1998, 4 (01) :1-22
[4]  
AGRAFIOTIS DK, 1999, ANN REPORTS COMBINAT, V2, P71
[5]  
AGRAFIOTIS DK, IN PRESS J COMP CHEM
[6]  
[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
[7]  
[Anonymous], 1952, Psychometrika
[8]  
[Anonymous], 2001, NEURAL NETWORKS COMP
[9]   EVALUATION OF PROJECTION ALGORITHMS [J].
BISWAS, G ;
JAIN, AK ;
DUBES, RC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1981, 3 (06) :701-708
[10]  
BLAKE C, 1998, UCI RESPOSITORY MACH