TRAINING NEURAL NETWORKS WITH THE GRG2 NONLINEAR OPTIMIZER

被引:23
作者
HUNG, MS
DENTON, JW
机构
[1] Graduate School of Management, Kent State University, Kent
关键词
ARTIFICIAL NEURAL NETWORKS; NONLINEAR PROGRAMMING; ADAPTIVE PROCESSES; LEARNING;
D O I
10.1016/0377-2217(93)90093-3
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Neural networks represent a new approach to artificial intelligence. By using biologically motivated intensively interconnected networks of simple processing elements, certain pattern recognition tasks can be accomplished much faster than with currently used techniques. The most popular means of training these networks is back propagation, a gradient descent technique. The introduction of back propagation revolutionized research in neural networks, but the method has serious drawbacks in training speed and scalability to large problems. This paper compares the use of a general-purpose nonlinear optimizer, GRG2, with back propagation in training neural networks. Parity problems of increasing size are used to evaluate the scalability of each method to larger problems. It was found that GRG2 not only found solutions much faster, but also found much better solutions. The use of nonlinear programming methods in training therefore has the potential to allow neural networks to be applied to problems that have previously been beyond their capabilities.
引用
收藏
页码:83 / 91
页数:9
相关论文
共 24 条
[1]  
Abadie J, 1969, GEN WOLFE REDUCED GR
[2]   OPTIMIZATION STRATEGIES GLEANED FROM BIOLOGICAL EVOLUTION [J].
BRADY, RM .
NATURE, 1985, 317 (6040) :804-806
[3]   ART-2 - SELF-ORGANIZATION OF STABLE CATEGORY RECOGNITION CODES FOR ANALOG INPUT PATTERNS [J].
CARPENTER, GA ;
GROSSBERG, S .
APPLIED OPTICS, 1987, 26 (23) :4919-4930
[4]  
CATER JP, 1987, IEEE 1 INT C NEUR NE, V2, P645
[5]  
DAHL ED, 1987, P INT C NEURAL NETWO, V2, P523
[6]  
*DEF ADV RES PROS, 1988, DARPA NEUR NETW STUD
[7]   TABU SEARCH TECHNIQUES - A TUTORIAL AND AN APPLICATION TO NEURAL NETWORKS [J].
DEWERRA, D ;
HERTZ, A .
OR SPEKTRUM, 1989, 11 (03) :131-141
[8]  
Fletcher R., 1987, PRACTICAL METHODS OP, DOI 10.1002/9781118723203
[9]  
Glover F., 1990, ORSA Journal on Computing, V2, P4, DOI [10.1287/ijoc.1.3.190, 10.1287/ijoc.2.1.4]
[10]  
HUSH DR, 1988, P INT C NEURAL NETWO, V1, P441