Adaptive traffic signal control using approximate dynamic programming

被引:162
作者
Cai, Chen [1 ]
Wong, Chi Kwong [1 ]
Heydecker, Benjamin G. [1 ]
机构
[1] UCL, Ctr Transport Studies, London WC1E 6BT, England
关键词
Traffic signal; Dynamic programming; Approximation; Adaptive; Reinforcement learning;
D O I
10.1016/j.trc.2009.04.005
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
This paper presents a study on an adaptive traffic signal controller for real-time operation. The controller aims for three operational objectives: dynamic allocation of green time, automatic adjustment to control parameters, and fast revision of signal plans. The control algorithm is built on approximate dynamic programming (ADP). This approach substantially reduces computational burden by using an approximation to the value function of the dynamic programming and reinforcement learning to update the approximation. We investigate temporal-difference learning and perturbation learning as specific learning techniques for the ADP approach. We find in computer simulation that the ADP controllers achieve substantial reduction in vehicle delays in comparison with optimised fixed-time plans. Our results show that substantial benefits can be gained by increasing the frequency at which the signal plans are revised, which can be achieved conveniently using the ADP approach. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:456 / 474
页数:19
相关论文
共 21 条
[1]  
[Anonymous], 2007, Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)
[2]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[3]  
BELL MC, 1986, P 2 IEE C ROAD TRAFF
[4]  
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics
[5]  
BETSEKAS DP, 1995, NEURODYNAMIC PROGRAM
[6]  
Gartner N.H., 1983, Transportation Research Record, P75
[7]  
Henry J.-J., 1984, Control in transportation systems, P305
[8]  
HENRY JJ, 1989, VEH NAV INF SYST C, P292
[9]  
Hunt P.B., 1982, Traffic Engineering and Control, V23
[10]  
LUK JYK, 1994, TRAFFIC ENG CONTROL, V25, P14