LEARNING FACTORIAL CODES BY PREDICTABILITY MINIMIZATION

被引:150
作者
SCHMIDHUBER, J
机构
关键词
D O I
10.1162/neco.1992.4.6.863
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
I propose a novel general principle for unsupervised learning of distributed nonredundant internal representations of input patterns. The principle is based on two opposing forces. For each representational unit there is an adaptive predictor which tries to predict the unit from the remaining units. In turn, each unit tries to react to the environment such that it minimizes its predictability. This encourages each unit to filter "abstract concepts" out of the environmental input such that these concepts are statistically independent of those on which the other units focus. I discuss various simple yet potentially powerful implementations of the principle that aim at finding binary factorial codes (Barlow et al. 1989), i.e., codes where the probability of the occurrence of a particular input is simply the product of the probabilities of the corresponding code symbols. Such codes are potentially relevant for (1) segmentation tasks, (2) speeding up supervised learning, and (3) novelty detection. Methods for finding factorial codes automatically implement Occam's razor for finding codes using a minimal number of units. Unlike previous methods the novel principle has a potential for removing not only linear but also nonlinear output redundancy. Illustrative experiments show that algorithms based on the principle of predictability minimization are practically feasible. The final part of this paper describes an entirely local algorithm that has a potential for learning unique representations of extended input sequences.
引用
收藏
页码:863 / 879
页数:17
相关论文
共 16 条
[1]   Finding Minimum Entropy Codes [J].
Barlow, H. B. ;
Kaushal, T. P. ;
Mitchison, G. J. .
NEURAL COMPUTATION, 1989, 1 (03) :412-423
[2]  
Becker S., 1991, International Journal of Neural Systems, V2, P17, DOI 10.1142/S0129065791000030
[3]   FORMING SPARSE REPRESENTATIONS BY LOCAL ANTI-HEBBIAN LEARNING [J].
FOLDIAK, P .
BIOLOGICAL CYBERNETICS, 1990, 64 (02) :165-170
[4]   SELF-ORGANIZATION IN A PERCEPTUAL NETWORK [J].
LINSKER, R .
COMPUTER, 1988, 21 (03) :105-117
[5]  
Oja E., 1989, International Journal of Neural Systems, V1, P61, DOI 10.1142/S0129065789000475
[6]  
PEARLMUTTER BA, 1986, NEURAL NETWORKS COMP, V2, P333
[7]  
PRELINGER D, 1992, THESIS TU MUNCHEN
[8]   DEVELOPMENT OF FEATURE-DETECTORS BY SELF-ORGANIZATION - A NETWORK MODEL [J].
RUBNER, J ;
SCHULTEN, K .
BIOLOGICAL CYBERNETICS, 1990, 62 (03) :193-199
[9]  
Sanger T. D., 1989, ADV NEURAL INFORMATI, V1, P11
[10]  
SCHMIDHUBER J, 1992, ADV NEUR IN, V4, P291