Recursive Bayesian Recurrent Neural Networks for Time-Series Modeling

被引:108
作者
Mirikitani, Derrick T. [1 ]
Nikolaev, Nikolay [1 ]
机构
[1] Univ London Goldsmiths Coll, Dept Comp, London SE14 6NW, England
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2010年 / 21卷 / 02期
关键词
Bayesian regularization; recurrent neural network (RNN); sequential Levenberg-Marquardt; LEARNING ALGORITHM; FEEDFORWARD; PREDICTION;
D O I
10.1109/TNN.2009.2036174
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
This paper develops a probabilistic approach to recursive second-order training of recurrent neural networks (RNNs) for improved time-series modeling. A general recursive Bayesian Levenberg-Marquardt algorithm is derived to sequentially update the weights and the covariance (Hessian) matrix. The main strengths of the approach are a principled handling of the regularization hyperparameters that leads to better generalization, and stable numerical performance. The framework involves the adaptation of a noise hyperparameter and local weight prior hyperparameters, which represent the noise in the data and the uncertainties in the model parameters. Experimental investigations using artificial and real-world data sets show that RNNs equipped with the proposed approach outperform standard real-time recurrent learning and extended Kalman training algorithms for recurrent networks, as well as other contemporary nonlinear neural models, on time-series modeling.
引用
收藏
页码:262 / 274
页数:13
相关论文
共 48 条
[1]
[Anonymous], 1994, NEURAL NETWORKS
[2]
[Anonymous], 1963, Soviet Math
[3]
[Anonymous], 2002, NETLAB: Algorithms for Pattern rRcognition
[4]
Bishop CM., 1995, NEURAL NETWORKS PATT
[5]
Campolucci P, 1998, IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, P384, DOI 10.1109/IJCNN.1998.682297
[6]
NONLINEAR PREDICTION OF CHAOTIC TIME-SERIES [J].
CASDAGLI, M .
PHYSICA D, 1989, 35 (03) :335-356
[7]
CHAN LW, 1999, P INT JOINT C NEUR N, V2, P1521
[8]
COELHO PHG, 2000, P IEEE INNS ENNS INT, V4, P563
[9]
NEURAL LEARNING OF CHAOTIC DYNAMICS [J].
DECO, G ;
SCHURMANN, B .
NEURAL PROCESSING LETTERS, 1995, 2 (02) :23-26
[10]
FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211