Neural Network Classifiers Estimate Bayesian a posteriori Probabilities

被引:654
作者
Richard, Michael D. [1 ]
Lippmann, Richard P. [1 ]
机构
[1] MIT, Lincoln Lab, Room B-349, Lexington, MA 02173 USA
关键词
D O I
10.1162/neco.1991.3.4.461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many neural network classifiers provide outputs which estimate Bayesian a posteriori probabilities. When the estimation is accurate, network outputs can be treated as probabilities and sum to one. Simple proofs show that Bayesian probabilities are estimated when desired network outputs are 2 of M (one output unity, all others zero) and a squared-error or cross-entropy cost function is used. Results of Monte Carlo simulations performed using multilayer perceptron (MLP) networks trained with backpropagation, radial basis function (RBF) networks, and high-order polynomial networks graphically demonstrate that network outputs provide good estimates of Bayesian probabilities. Estimation accuracy depends on network complexity, the amount of training data, and the degree to which training data reflect true likelihood distributions and a priori class probabilities. Interpretation of network outputs as Bayesian probabilities allows outputs from multiple networks to be combined for higher level decision making, simplifies creation of rejection thresholds, makes it possible to compensate for differences between pattern class probabilities in training and test data, allows outputs to be used to minimize alternative risk functions, and suggests alternative measures of network performance.
引用
收藏
页码:461 / 483
页数:23
相关论文
共 26 条
[1]  
[Anonymous], 1989, NIPS 1989
[2]  
Barron A. R., 1984, SELF ORGANIZING METH, P25
[3]  
Baum E. B., 1988, NEURAL INFORMATION P, V12, P52
[4]  
Bourlard H., 1989, 89033 INT COMP SCI I
[5]  
Bourlard H., 1990, ADV NEURAL INFORMATI, V2, P186
[6]  
Bridle J. S., 1990, PROC 2 INT C NEURAL, P211, DOI [10.5555/2969830, DOI 10.5555/2969830]
[7]  
Duda R. O., 1973, PATTERN CLASSIFICATI, V3
[8]  
El-Jaroudi A., 1990, P INT JOINT C NEUR N
[9]  
Fukunaga K., 1972, INTRO STAT PATTERN R
[10]  
GISH H, 1990, INT CONF ACOUST SPEE, P1361, DOI 10.1109/ICASSP.1990.115636