Investigating the prediction capabilities of the simple recurrent neural network on real temporal sequences

被引:15
作者
Gupta, L [1 ]
McAvoy, M
机构
[1] So Illinois Univ, Coll Engn, Dept Elect Engn, Carbondale, IL 62901 USA
[2] Washington Univ, Neuroimaging Lab, St Louis, MO 63110 USA
关键词
recurrent neural network; prediction; initial context vector; network retraining;
D O I
10.1016/S0031-3203(99)00187-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of this paper is to evaluate the prediction capabilities of the simple recurrent neural network (SRNN). The main focus is on the prediction of non-orthogonal vector components of real temporal sequences. A prediction problem is formulated in which the input is a component of a real sequence and the output is a prediction of the next component of the sequence. A method is developed to train a single SRNN to predict the components of sequences belonging to multiple classes. The selection of a distinguishing initial context vector for each class is proposed to improve the prediction performance of the SRNN. A systematic method to re-train the SRNN with noisy exemplars is developed to improve the prediction generalization of the network. Through the methods developed in the paper, it is demonstrated that: (a) a single SRNN can be trained to predict, contextually, the components of real temporal sequences belonging to different classes, (b) the prediction error of the SRNN can be decreased by using a distinguishing initial context vector for each class, and (c) the prediction generalization of the SRNN can be increased significantly by re-training the network with noisy exemplars. (C) 2000 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:2075 / 2081
页数:7
相关论文
共 15 条
[1]  
[Anonymous], NEURAL COMPUTERS NEU
[2]  
BLAKE R, 1996, THESIS SO ILLINOIS U
[3]   Finite State Automata and Simple Recurrent Networks [J].
Cleeremans, Axel ;
Servan-Schreiber, David ;
McClelland, James L. .
NEURAL COMPUTATION, 1989, 1 (03) :372-381
[4]   FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211
[5]  
ELMAN JL, 1991, MACH LEARNING, V7, P19
[6]  
GHOSH J, 1992, SPIE, V1710, P449
[7]   ROBUST AUTOMATIC TARGET RECOGNITION USING A LOCALIZED BOUNDARY REPRESENTATION [J].
GUPTA, L ;
SORTRAKUL, T ;
CHARLES, A ;
KISATSKY, P .
PATTERN RECOGNITION, 1995, 28 (10) :1587-1598
[8]   A NEURAL NETWORK APPROACH TO ROBUST SHAPE CLASSIFICATION [J].
GUPTA, L ;
SAYEH, MR ;
TAMMANA, R .
PATTERN RECOGNITION, 1990, 23 (06) :563-568
[9]  
JORDAN MI, 1988, 8827 MIT COINS
[10]   Predictive modular neural networks for time series classification [J].
Kehagias, A ;
Petridis, V .
NEURAL NETWORKS, 1997, 10 (01) :31-49