STRONG UNIVERSAL CONSISTENCY OF NEURAL-NETWORK CLASSIFIERS

被引:50
作者
FARAGO, A [1 ]
LUGOSI, G [1 ]
机构
[1] TECH UNIV BUDAPEST,FAC ELECT ENGN,DEPT MATH,H-1521 BUDAPEST,HUNGARY
关键词
PATTERN RECOGNITION; NEURAL NETWORKS; NONPARAMETRIC CLASSIFICATION; CONSISTENCY; TRAINING ALGORITHMS;
D O I
10.1109/18.243433
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In statistical pattern recognition a classifier is called universally consistent if its error probability converges to the Bayes-risk as the size of the training data grows, for all possible distributions of the random variable pair of the observation vector and its class. It is proven that if a one layered neural network with properly chosen number of nodes is trained to minimize the empirical risk on the training data, then it results in a universally consistent classifier. It is shown that the exponent in the rate of convergence does not depend on the dimension if certain smoothness conditions on the distribution are satisfied. That is, this class of universally consistent classifiers does not suffer from the ''curse of dimensionality.'' A training algorithm is also presented that finds the optimal set of parameters in polynomial time if the number of nodes and the space dimension is fixed and the amount of training data grows.
引用
收藏
页码:1146 / 1151
页数:6
相关论文
共 27 条
[1]  
BARRON AR, 1991, COMPUTATIONAL LEARNI
[2]  
BARRON AR, 1993, IN PRESS IEEE T INFO
[3]  
BARRON AR, 1991, P NATO ASI NONPARAME
[4]  
BARRON AR, 1992, YALE WORKSHOP ADAPTI
[5]   What Size Net Gives Valid Generalization? [J].
Baum, Eric B. ;
Haussler, David .
NEURAL COMPUTATION, 1989, 1 (01) :151-160
[6]  
BLUM A, 1988, 1ST P WORKSH COMP LE, P9
[7]   BACK PROPAGATION FAILS TO SEPARATE WHERE PERCEPTRONS SUCCEED [J].
BRADY, ML ;
RAGHAVAN, R ;
SLAWNY, J .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (05) :665-674
[8]   GEOMETRICAL AND STATISTICAL PROPERTIES OF SYSTEMS OF LINEAR INEQUALITIES WITH APPLICATIONS IN PATTERN RECOGNITION [J].
COVER, TM .
IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1965, EC14 (03) :326-&
[9]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274