CONSTRUCTIVE LEARNING OF RECURRENT NEURAL NETWORKS - LIMITATIONS OF RECURRENT CASADE CORRELATION AND A SIMPLE SOLUTION

被引:47
作者
GILES, CL
CHEN, D
SUN, GZ
CHEN, HH
LEE, YC
GOUDREAU, MW
机构
[1] NEC RES INST, PRINCETON, NJ 08540 USA
[2] UNIV CENT FLORIDA, DEPT COMP SCI, ORLANDO, FL 32816 USA
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1995年 / 6卷 / 04期
关键词
D O I
10.1109/72.392247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is often difficult to predict the optimal neural network size for a particular application, Constructive or destructive methods that add or subtract neurons, layers, connections, etc, might offer a solution to this problem, We prove that one method, recurrent cascade correlation, due to its topology, has fundamental limitations in representation and thus in its learning capabilities, It cannot represent with monotone (i.e., sigmoid) and hard-threshold activation functions certain finite state automata, We give a ''preliminary'' approach on how to get ground these limitations by devising a simple constructive training method that adds neurons during training while still preserving the powerful fully-recurrent structure, We illustrate this approach by simulations which learn many examples of regular grammars that the recurrent cascade correlation method is unable to learn.
引用
收藏
页码:829 / 836
页数:8
相关论文
共 43 条
[1]   EFFICIENT SIMULATION OF FINITE AUTOMATA BY NEURAL NETS [J].
ALON, N ;
DEWDNEY, AK ;
OTT, TJ .
JOURNAL OF THE ACM, 1991, 38 (02) :495-514
[2]  
[Anonymous], 1990, ADV NEURAL INF PROCE
[3]  
[Anonymous], 1991, INTRO THEORY NEURAL, DOI DOI 10.1201/9780429499661
[4]  
Ash T., 1989, Connection Science, V1, P365, DOI 10.1080/09540098908915647
[5]   Finite State Automata and Simple Recurrent Networks [J].
Cleeremans, Axel ;
Servan-Schreiber, David ;
McClelland, James L. .
NEURAL COMPUTATION, 1989, 1 (03) :372-381
[6]  
DIEDERICH J, 1988, 8TH P EUR C ART INT
[7]   FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211
[8]  
FAHLMAN S, 1990, CMUCS90100 CARN U SC
[9]  
Fahlman SE, 1991, ADV NEURAL INFORMATI, P190
[10]  
Frean M, 1990, NEURAL COMPUT, V2, P198