Optimal learning in artificial neural networks: A review of theoretical results

被引:34
作者
Bianchini, M [1 ]
Gori, M [1 ]
机构
[1] UNIV FLORENCE,DIPARTIMENTO SISTEMI & INFORMAT,I-50139 FLORENCE,ITALY
关键词
learning algorithms; optimal learning; connectionist models;
D O I
10.1016/0925-2312(95)00032-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The effectiveness of connectionist models in emulating intelligent behaviour and solving significant practical problems is strictly related to the capability of the learning algorithms to find optimal or near-optimal solutions and to generalize to new examples, This paper reviews some theoretical contributions to optimal learning in the attempt to provide a unified view and give the state of the art in the field. The focus of the review is on the problem of local minima in the cost function that is likely to affect more or less any learning algorithm. Starting from this analysis, we briefly review proposals for discovering optimal solutions and suggest conditions for designing architectures tailored to a given task.
引用
收藏
页码:313 / 346
页数:34
相关论文
共 84 条
[1]  
ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
[2]  
Anderson J. A., 1988, Neurocomputing: Foundations of research
[3]  
[Anonymous], 1969, APPL OPTIMAL CONTROL
[4]  
[Anonymous], 1987, COMPUT SPEECH LANG
[5]   NEURAL NETWORKS AND PRINCIPAL COMPONENT ANALYSIS - LEARNING FROM EXAMPLES WITHOUT LOCAL MINIMA [J].
BALDI, P ;
HORNIK, K .
NEURAL NETWORKS, 1989, 2 (01) :53-58
[6]  
BAUM EB, 1988, NEURAL INFORMATION P, P52
[7]   LEARNING THE DYNAMIC NATURE OF SPEECH WITH BACKPROPAGATION FOR SEQUENCES [J].
BENGIO, Y ;
DEMORI, R ;
GORI, M .
PATTERN RECOGNITION LETTERS, 1992, 13 (05) :375-385
[8]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[9]  
BENGIO Y, 1990, SPEECH COMMUN, V9, P15
[10]   ON THE PROBLEM OF LOCAL MINIMA IN RECURRENT NEURAL NETWORKS [J].
BIANCHINI, M ;
GORI, M ;
MAGGINI, M .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :167-172