Short-term traffic flow forecasting: parametric and nonparametric approaches via emotional temporal difference learning

被引:21
作者
Abdi, Javad [1 ]
Moshiri, Behzad [2 ]
Abdulhai, Baher [3 ]
Sedigh, Ali Khaki [4 ]
机构
[1] Islamic Azad Univ, Dept Elect & Comp Engn, Sci & Res Branch, Tehran, Iran
[2] Univ Tehran, Control & Intelligent Proc Ctr Excellence, Sch Elect & Comp Engn, Tehran, Iran
[3] Univ Toronto, Dept Civil Engn, Toronto Intelligent Transportat Syst Ctr & Testbe, Toronto, ON, Canada
[4] KN Toosi Univ Technol, Dept Elect Engn, Tehran, Iran
关键词
Multi-agent systems; Multi-step forecasting; Reinforcement learning; Emotional learning; Q-learning temporal difference learning; Autoregressive integrated moving average; Neural network; Multi-layer perceptron neural networks; Short-term traffic flow; ARTIFICIAL NEURAL-NETWORKS; TIME-SERIES; HYBRID ARIMA; SAMPLE-SIZE; MODEL; PREDICTION; IDENTIFICATION; PERFORMANCE; SIMULATION; VOLUME;
D O I
10.1007/s00521-012-0977-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information signal from real case and natural complex dynamical systems such as traffic flow are usually specified by irregular motions. Chaotic nonlinear dynamics approach is now the most powerful tool for scientists to deal with complexities in real cases, and neural networks and neuro-fuzzy models are widely used for their capabilities in nonlinear modeling of chaotic systems more than the traditional methods. As mentioned, the traffic flow conditions caused the forecasting values of traffic flow to lack robustness and accuracy. In this paper, the traffic flow forecasting is analyzed with emotional concepts and multi-agent systems (MASs) points of view as a new method in this field. The findings enabled the researchers to develop a newly object-oriented method of forecasting traffic flow. Its architecture is based on a temporal difference (TD) Q-learning with a neuro-fuzzy structure, which is the nonparametric approach. The performance of TD Q-learning is improved by emotional learning. The proposed method on the present conditions and the action of the system according to the criteria could forecast traffic signals so that the objectives are reached in minimum time. The ability of presented learning algorithm to prospect gains from future actions and obtain rewards from its past experiences allows emotional TD Q-learning algorithm to improve its decisions for the best possible actions. In addition, to study in a more practical situation, the neuro-fuzzy behaviors could be modeled by MAS. The proposed method (intelligent/nonparametric approach) is compared by parametric approach, autoregressive integrated moving average (ARIMA) method, which is implemented by multi-layer perceptron neural networks and called ARIMANN. Here, the ARIMANN is updated by backpropagation and temporal difference backpropagation for the first time. The simulation results revealed that the studied forecaster could discover the optimal forecasting by means of the Q-learning algorithm. Difficult to handle through parametric and classic methods, the real traffic flow signals used for fitting the algorithms is obtained from a two-lane street I-494 in Minnesota City.
引用
收藏
页码:141 / 159
页数:19
相关论文
共 77 条
[1]  
Abdi J., 2004, INT J ENG NATL CTR S, V17, P363
[2]  
Abdi J., 2005, J SCI TECHNOL SHARIF, V30, P13
[3]   Short-term traffic flow prediction using neuro-genetic algorithms [J].
Abdulhai, B ;
Porwal, H ;
Recker, W .
ITS JOURNAL, 2002, 7 (01) :3-41
[4]  
Ahmed M. S., 1979, Analysis of freeway traffic timeseries data by using Box-Jenkins techniques
[5]   Artificial neural networks as applied to long-term demand forecasting [J].
Al-Saba, T ;
El-Amin, I .
ARTIFICIAL INTELLIGENCE IN ENGINEERING, 1999, 13 (02) :189-197
[6]  
[Anonymous], 2004, EQ SMOOTH MOD
[7]  
[Anonymous], AM J MATH MANAGEMENT
[8]  
[Anonymous], 1990, Proceedings of the International Joint Conference on Neural Networks
[9]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[10]  
Balkenius C, 2000, ANIMALS ANIMATS