Statistical active learning in multilayer perceptrons

被引:61
作者
Fukumizu, K
机构
[1] Brain Science Institute, RIKEN, Saitama
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2000年 / 11卷 / 01期
关键词
active learning; Fisher information matrix; multilayer perceptron; pruning;
D O I
10.1109/72.822506
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes new methods for generating input locations actively in gathering training data, aiming at solving problems unique to multilayer perceptrons. One of the problems is that optimum input locations, which are calculated deterministically, sometimes distribute densely around the same point and cause local minima in backpropagation training, Two probabilistic active learning methods, which utilize the statistical variance of locations, ale proposed to solve this problem. One is parametric active learning and the other is multipoint-search active learning. Another serious problem in applying active learning to multilayer perceptrons is that a Fisher information matrix can be singular, while many methods, including the proposed ones, assume its regularity, A technique of pruning redundant hidden units is proposed to keep the Fisher information matrix regular. Combined with this technique, active learning can be applied stably to multilayer perceptrons, The effectiveness of the proposed methods is demonstrated through computer simulations on simple artificial problems and a real-world problem of color conversion.
引用
收藏
页码:17 / 26
页数:10
相关论文
共 25 条
[21]   QUERY CONSTRUCTION, ENTROPY, AND GENERALIZATION IN NEURAL-NETWORK MODELS [J].
SOLLICH, P .
PHYSICAL REVIEW E, 1994, 49 (05) :4637-4651
[22]   CROSS-VALIDATORY CHOICE AND ASSESSMENT OF STATISTICAL PREDICTIONS [J].
STONE, M .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1974, 36 (02) :111-147
[23]   PROBABILISTIC DESIGN OF LAYERED NEURAL NETWORKS BASED ON THEIR UNIFIED FRAMEWORK [J].
WATANABE, S ;
FUKUMIZU, K .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (03) :691-702
[24]   Learning in Artificial Neural Networks: A Statistical Perspective [J].
White, Halbert .
NEURAL COMPUTATION, 1989, 1 (04) :425-464
[25]  
Yule J. A., 1967, PRINCIPLES COLOR REP