Statistical active learning in multilayer perceptrons

被引:61
作者
Fukumizu, K
机构
[1] Brain Science Institute, RIKEN, Saitama
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2000年 / 11卷 / 01期
关键词
active learning; Fisher information matrix; multilayer perceptron; pruning;
D O I
10.1109/72.822506
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes new methods for generating input locations actively in gathering training data, aiming at solving problems unique to multilayer perceptrons. One of the problems is that optimum input locations, which are calculated deterministically, sometimes distribute densely around the same point and cause local minima in backpropagation training, Two probabilistic active learning methods, which utilize the statistical variance of locations, ale proposed to solve this problem. One is parametric active learning and the other is multipoint-search active learning. Another serious problem in applying active learning to multilayer perceptrons is that a Fisher information matrix can be singular, while many methods, including the proposed ones, assume its regularity, A technique of pruning redundant hidden units is proposed to keep the Fisher information matrix regular. Combined with this technique, active learning can be applied stably to multilayer perceptrons, The effectiveness of the proposed methods is demonstrated through computer simulations on simple artificial problems and a real-world problem of color conversion.
引用
收藏
页码:17 / 26
页数:10
相关论文
共 25 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
[Anonymous], 1982, ESTIMATION DEPENDENC
[3]  
Cohn D. A., 1994, ADV NEURAL INFORMATI, P679
[4]  
CRAMER H, 1946, MATH METHOD STAT, P497
[5]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[6]  
Efron B., 1993, INTRO BOOTSTRAP, V1st ed., DOI DOI 10.1201/9780429246593
[7]  
Fedorov V. V., 1992, THEORY OPTIMAL EXPT
[8]  
Fukumizu K, 1996, ADV NEUR IN, V8, P295
[9]   A regularity condition of the information matrix of a multilayer perceptron network [J].
Fukumizu, K .
NEURAL NETWORKS, 1996, 9 (05) :871-879
[10]  
FUKUMIZU K, 1994, 1994 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOL 1-7, P777, DOI 10.1109/ICNN.1994.374276