AN EXACTLY SOLVABLE MODEL OF UNSUPERVISED LEARNING

被引:17
作者
BIEHL, M
机构
[1] CONNECT, The Niels Bohr Institute, Copenhagen Ø, DK-2100
来源
EUROPHYSICS LETTERS | 1994年 / 25卷 / 05期
关键词
D O I
10.1209/0295-5075/25/5/014
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
A model for unsupervised learning from N-dimensional data is studied. Random training examples are drawn such that the distribution of their overlaps with a vector B is-an-element-of R(N) is a mixture of two Gaussians of unit width and a separation rho. A student vector is generated by an on-line algorithm, using each example only once. The evolution of its overlap with B can be calculated exactly in the themodynamic limit N --> infinity. As a specific example, a learning algorithm closely related to Oja's rule is investigated. Its dynamics and approach to the stationary solution are solved for both a constant and an optimally chosen time-dependent learning rate. For the latter, the limits of small and infinitely large separation rho of the peaks are considered. In both limits the analysis suggests the use of an asymptotic (1/p)-decay for the learning rate, where p is the number of training examples. In the large-separation limit, the typical number of examples needed for successful learning is found to be (p/N) is-proportional-to rho-2, which coincides with a recent result for supervised learning from Gaussian mixtures.
引用
收藏
页码:391 / 396
页数:6
相关论文
共 16 条
[1]  
[Anonymous], 1991, INTRO THEORY NEURAL, DOI DOI 10.1201/9780429499661
[2]  
BARKAI N, 1993, SCALING LAWS LEARNIN
[3]   ONLINE LEARNING OF A TIME-DEPENDENT RULE [J].
BIEHL, M ;
SCHWARZE, H .
EUROPHYSICS LETTERS, 1992, 20 (08) :733-738
[4]   STATISTICAL-MECHANICS OF UNSUPERVISED LEARNING [J].
BIEHL, M ;
MIETZNER, A .
EUROPHYSICS LETTERS, 1993, 24 (05) :421-426
[5]  
BIEHL M, 1993, STATISTICAL MECHANIC
[6]  
DARKEN C, 1992, ADV NEUR IN, V4, P1009
[7]  
GOLDSTEIN L, 1987, DRB306 U SO CAL DEP
[8]   OPTIMAL GENERALIZATION IN PERCEPTRONS [J].
KINOUCHI, O ;
CATICHA, N .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1992, 25 (23) :6243-6250
[9]   IMPROVING A NETWORK GENERALIZATION ABILITY BY SELECTING EXAMPLES [J].
KINZEL, W ;
RUJAN, P .
EUROPHYSICS LETTERS, 1990, 13 (05) :473-477
[10]   A SIMPLIFIED NEURON MODEL AS A PRINCIPAL COMPONENT ANALYZER [J].
OJA, E .
JOURNAL OF MATHEMATICAL BIOLOGY, 1982, 15 (03) :267-273