ASYMPTOTICALLY OPTIMAL DISCRIMINANT FUNCTIONS FOR PATTERN CLASSIFICATION

被引:102
作者
WOLVERTON, CT
WAGNER, TJ
机构
[1] Mitre Corporation, Bedford, Mass.
[2] Electronics Research Center, Department of Electrical Engineering, University of Texas, Austin
关键词
D O I
10.1109/TIT.1969.1054295
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The two category classification problem is treated. No a priori knowledge of the statistics of the classes is assumed. A sequence of labeled samples from the two classes is used to construct a sequence of approximations of a discriminant function that is optimum in the sense of minimizing the probability of misclassification but which requires knowledge of all the statistics of the classes. Depending on the assumptions made about the probability densities corresponding to the two classes, the integrated square error of the approximations converges to 0 in probability or with probability 1. The approximations are nonparametric and recursive for each fixed point of the domain. Rates of convergence are given. The approximations are used to define a decision procedure for classifying unlabeled samples. It is shown that as the number of labeled samples used to construct the approximations increases, the resulting sequence of discriminant functions is asymptotically optimal in the sense that the probability of misclassification when using the approximations in the decision procedure converges in probability or with probability 1, depending on the assumptions made, to the probability of misclassification of the optimum discriminant function. The results can be easily extended to the multicategory problem and to the case of arbitrary loss functions, that is, where the costs of misclassification are not necessarily equal to 1. © 1969 IEEE. All rights reserved.
引用
收藏
页码:258 / +
页数:1
相关论文
共 13 条
[1]  
AIZERMAN MA, 1965, AUTOMAT REM CONTR+, V25, P1175
[2]  
BRAVERMAN EM, 1966, AUTOMAT REM CONTR+, V27, P80
[3]  
DVORETZKY A, 1956, 3 P BERK S MATH STAT, V1, P39
[4]  
FU KS, 1966, TREE666 PURD U SCH E
[5]  
Halmos P.R., 1950, MEASURE THEORY
[6]  
HO YC, 1966 P NEC
[7]   ON NON-PARAMETRIC ESTIMATES OF DENSITY FUNCTIONS AND REGRESSION CURVES [J].
NADARAYA, EA .
THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1965, 10 (01) :186-&
[8]   ESTIMATION OF A PROBABILITY DENSITY-FUNCTION AND MODE [J].
PARZEN, E .
ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (03) :1065-&
[9]   A MEAN-SQUARE PERFORMANCE CRITERION FOR ADAPTIVE PATTERN CLASSIFICATION SYSTEMS [J].
PATTERSO.JD ;
WAGNER, TJ ;
WOMACK, BF .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1967, AC12 (02) :195-+
[10]  
SCHWARTZ SC, 1967, 5 P ALL C CIRC SYST