Multilayer neural networks and Bayes decision theory

被引:52
作者
Funahashi, K [1 ]
机构
[1] Univ Aizu, Ctr Math Sci, Fukushima 965, Japan
关键词
multilayer neural network; Bayes decision theory; back-propagation algorithm; Bayes discriminant function; Gaussian probability density; A posteriori probability;
D O I
10.1016/S0893-6080(97)00120-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are many applications of multilayer neural networks to pattern classification problems in the engineering field. Recently, it has been shown that Bayes a posteriori probability can be estimated by feedforward neural networks through computer simulation. In this paper, Bayes decision theory is combined with the approximation theory an three-layer neural networks, and the two-category n-dimensional Gaussian classification problem is studied. First, we prove theoretically that three-layer neural networks with at least 2n hidden units have the capability of approximating the a posteriori probability in the two-category classification problem with arbitrary accuracy. Second, we prove that the input-output function of neural networks with at least 2n hidden units tends to the a posteriori probability as Back-Propagation learning proceeds ideally. These results provide a theoretical basis for the study of pattern classification by computer simulation. (C) 1998 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:209 / 213
页数:5
相关论文
共 10 条
[1]   ON THE APPROXIMATE REALIZATION OF CONTINUOUS-MAPPINGS BY NEURAL NETWORKS [J].
FUNAHASHI, K .
NEURAL NETWORKS, 1989, 2 (03) :183-192
[2]  
Funahashi K., 1990, Electronics and Communications in Japan, Part 3 (Fundamental Electronic Science), V73, P61, DOI 10.1002/ecjc.4430731107
[3]  
FUNAHASHI K, 1990, I ELECTRON INFORM A, V73, P139
[4]  
Hart P.E., 1973, Pattern recognition and scene analysis
[5]   ARBITRARY NONLINEARITY IS SUFFICIENT TO REPRESENT ALL FUNCTIONS BY NEURAL NETWORKS - A THEOREM [J].
KREINOVICH, VY .
NEURAL NETWORKS, 1991, 4 (03) :381-383
[6]   Neural Network Classifiers Estimate Bayesian a posteriori Probabilities [J].
Richard, Michael D. ;
Lippmann, Richard P. .
NEURAL COMPUTATION, 1991, 3 (04) :461-483
[7]  
Ruck D W, 1990, IEEE Trans Neural Netw, V1, P296, DOI 10.1109/72.80266
[8]  
Rumelhart D. E., 1986, PARALLEL DISTRIBUTED, V1
[9]  
TODA N, 1991, P INT JOINT C NEUR N, P343
[10]   Learning in Artificial Neural Networks: A Statistical Perspective [J].
White, Halbert .
NEURAL COMPUTATION, 1989, 1 (04) :425-464