STRONG UNIVERSAL CONSISTENCY OF NEURAL-NETWORK CLASSIFIERS

被引：50

作者：

FARAGO, A ^{[1
]}

LUGOSI, G ^{[1
]}

机构：

[1] TECH UNIV BUDAPEST,FAC ELECT ENGN,DEPT MATH,H-1521 BUDAPEST,HUNGARY

来源：

IEEE TRANSACTIONS ON INFORMATION THEORY | 1993年 / 39卷 / 04期

关键词：

PATTERN RECOGNITION; NEURAL NETWORKS; NONPARAMETRIC CLASSIFICATION; CONSISTENCY; TRAINING ALGORITHMS;

D O I：

10.1109/18.243433

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In statistical pattern recognition a classifier is called universally consistent if its error probability converges to the Bayes-risk as the size of the training data grows, for all possible distributions of the random variable pair of the observation vector and its class. It is proven that if a one layered neural network with properly chosen number of nodes is trained to minimize the empirical risk on the training data, then it results in a universally consistent classifier. It is shown that the exponent in the rate of convergence does not depend on the dimension if certain smoothness conditions on the distribution are satisfied. That is, this class of universally consistent classifiers does not suffer from the ''curse of dimensionality.'' A training algorithm is also presented that finds the optimal set of parameters in polynomial time if the number of nodes and the space dimension is fixed and the amount of training data grows.

引用

页码：1146 / 1151

页数：6

共 27 条

[1]

BARRON AR, 1991, COMPUTATIONAL LEARNI

[2]

BARRON AR, 1993, IN PRESS IEEE T INFO

[3]

BARRON AR, 1991, P NATO ASI NONPARAME

[4]

BARRON AR, 1992, YALE WORKSHOP ADAPTI

[5] What Size Net Gives Valid Generalization? [J].

Baum, Eric B. ;

Haussler, David .

NEURAL COMPUTATION, 1989, 1 (01) :151-160

[6]

BLUM A, 1988, 1ST P WORKSH COMP LE, P9

[7] BACK PROPAGATION FAILS TO SEPARATE WHERE PERCEPTRONS SUCCEED [J].

BRADY, ML ;

RAGHAVAN, R ;

SLAWNY, J .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (05) :665-674

[8] GEOMETRICAL AND STATISTICAL PROPERTIES OF SYSTEMS OF LINEAR INEQUALITIES WITH APPLICATIONS IN PATTERN RECOGNITION [J].

COVER, TM .

IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1965, EC14 (03) :326-&

[9]

Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274

[10] AUTOMATIC PATTERN-RECOGNITION - A STUDY OF THE PROBABILITY OF ERROR [J].

DEVROYE, L .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1988, 10 (04) :530-543

← 1 2 3 →