OPTIMIZED FEATURE-EXTRACTION AND THE BAYES DECISION IN FEEDFORWARD CLASSIFIER NETWORKS

被引:56
作者
LOWE, D
WEBB, AR
机构
[1] Royal Signals and Radar Establishment, Worcs, WR14 3PS, St. Andrews Road, Great Malvern
关键词
ADAPTIVE LAYERED NETWORKS; BAYES MINIMUM RISK; DISCRIMINANT ANALYSIS; LEARNING; LEAST-SQUARES OPTIMIZATION; PATTERN CLASSIFICATION; PRIOR PROBABILITIES;
D O I
10.1109/34.88570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we address the problem of multiclass pattern classification using adaptive layered networks. We consider a special class of networks, i.e., feed-forward networks with a linear final layer, that perform generalized linear discriminant analysis. This class is sufficiently generic to encompass the behavior of arbitrary feed-forward nonlinear networks since there is no restriction on the number of nonlinear hidden layers. Additionally, the use of a final layer of linear output nodes allows a formal analysis to be made to predict the optimum network performance. Training the network consists of a least-square approach which combines a generalized inverse computation to solve for the final layer weights, together with a nonlinear optimization scheme to solve for parameters of the nonlinearities. Such an approach performs feature extraction and classification simultaneously, in which the feature extraction is (optimally) matched to the classification scheme. We derive a general analytic form for the feature extraction criterion and interpret it for specific forms of target coding and error weighting. An important aspect of the approach is to exhibit how a priori information regarding nonuniform class membership, uneven distribution between train and test sets and misclassification costs may be exploited in a regularized manner in the training phase of networks.
引用
收藏
页码:355 / 364
页数:10
相关论文
共 29 条
[1]  
[Anonymous], 1974, CLASSIFICATION ESTIM
[2]  
Asohand H., 1990, P IJCNN 90 SAN DIEG, pIII
[3]  
BOUNDS DG, 1988, P IEEE INT C NEURAL, V2, P481
[4]  
BOURLARD H, 1988, NEURAL NETWORKS, V2, P53
[5]  
Breiman L, 2017, CLASSIFICATION REGRE, P368, DOI 10.1201/9781315139470
[6]  
Broomhead D. S., 1988, COMPLEX SYSTEMS, P269
[7]  
Devijver P. A., 1973, 1st International Joint Conference on Pattern Recognition, P139
[8]  
Devijver PA, 1982, PATTERN RECOGNITION
[9]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[10]  
FUKUNAGA K, 1972, INTRO STATISTICAL PA