A CONNECTIONIST LEARNING ALGORITHM WITH PROVABLE GENERALIZATION AND SCALING BOUNDS

被引:12
作者
GALLANT, SI
机构
关键词
BRD algorithm; Computational learning theory; Connectionist; Decision lists; Distributed method; k-order distributed networks; Neural network; PAC-Learning; Perceptron; Pocket algorithm; Valiant Model; Vapnik-Chervonenkis dimension;
D O I
10.1016/0893-6080(90)90089-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A connectionist learning algorithm, the bounded, randomized, distributed (BRD) algorithm, is presented and formally analyzed within the framework of computational learning theory. From a neural network viewpoint this framework gives clear definitions to commonly used terms such as "generalization" and "scaling up," and addresses the following questions: • • What class of functions is being learned? • • How many training examples should be used? • • How many iterations are required? • • With what certainty can we be assured of learning a good model? From a computational learning theory perspective, a new class of connectionist concepts is shown to be polynomially learnable using the BRD algorithm. Since a variant of the BRD algorithm is in current use for tasks such as pattern recognition, this makes it one of the few learning algorithms shown to be polynomial within the computational learning theory framework that is close to an "industrial strength" algorithm. The algorithm can fail for several reasons: (a) noisy inputs; (b) underestimation of the difficulty of the concept being learned (i.e., larger concept class required); or (c) bad luck. Whenever the algorithm fails, there are several "fallback bounds" available. Finally, the Appendix gives a learnable class of network functions that strictly enlarges a class of learnable functions, Rivest's k-decision lists. © 1990.
引用
收藏
页码:191 / 201
页数:11
相关论文
共 31 条
[1]   What Size Net Gives Valid Generalization? [J].
Baum, Eric B. ;
Haussler, David .
NEURAL COMPUTATION, 1989, 1 (01) :151-160
[2]  
BLUM A, 1988, COLT 88, P9
[3]   OCCAM RAZOR [J].
BLUMER, A ;
EHRENFEUCHT, A ;
HAUSSLER, D ;
WARMUTH, MK .
INFORMATION PROCESSING LETTERS, 1987, 24 (06) :377-380
[4]   LEARNABILITY AND THE VAPNIK-CHERVONENKIS DIMENSION [J].
BLUMER, A ;
EHRENFEUCHT, A ;
HAUSSLER, D ;
WARMUTH, MK .
JOURNAL OF THE ACM, 1989, 36 (04) :929-965
[5]   GEOMETRICAL AND STATISTICAL PROPERTIES OF SYSTEMS OF LINEAR INEQUALITIES WITH APPLICATIONS IN PATTERN RECOGNITION [J].
COVER, TM .
IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1965, EC14 (03) :326-&
[7]  
Gallant S. I., 1986, Eighth International Conference on Pattern Recognition. Proceedings (Cat. No.86CH2342-4), P849
[8]   CONNECTIONIST EXPERT SYSTEMS [J].
GALLANT, SI .
COMMUNICATIONS OF THE ACM, 1988, 31 (02) :152-169
[9]  
GALLANT SI, IN PRESS IEEE T NEUR
[10]  
GALLANT SI, 1987, IEEE INT C NEUR NETW, V2, P671