Urban traffic signal control using reinforcement learning agents

被引:133
作者
Balaji, P. G. [1 ]
German, X. [1 ]
Srinivasan, D. [1 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117576, Singapore
关键词
NEURAL-NETWORKS; SYSTEM;
D O I
10.1049/iet-its.2009.0096
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This study presents a distributed multi-agent-based traffic signal control for optimising green timing in an urban arterial road network to reduce the total travel time and delay experienced by vehicles. The proposed multi-agent architecture uses traffic data collected by sensors at each intersection, stored historical traffic patterns and data communicated from agents in adjacent intersections to compute green time for a phase. The parameters like weights, threshold values used in computing the green time is fine tuned by online reinforcement learning with an objective to reduce overall delay. PARAMICS software was used as a platform to simulate 29 signalised intersection at Central Business District of Singapore and test the performance of proposed multi-agent traffic signal control for different traffic scenarios. The proposed multi-agent reinforcement learning (RLA) signal control showed significant improvement in mean time delay and speed in comparison to other traffic control system like hierarchical multi-agent system (HMS), cooperative ensemble (CE) and actuated control.
引用
收藏
页码:177 / 188
页数:12
相关论文
共 23 条
[11]   THE GLIDE SYSTEM - SINGAPORE URBAN TRAFFIC CONTROL-SYSTEM [J].
KEONG, CK .
TRANSPORT REVIEWS, 1993, 13 (04) :295-305
[12]  
Koonce P., 2008, FHWAHOP08024
[13]   A PROOF FOR THE QUEUING FORMULA - L=LAMBDA-W [J].
LITTLE, JDC .
OPERATIONS RESEARCH, 1961, 9 (03) :383-387
[14]  
Lowrie P.R., 1982, Proc. IEE Conference on Road Traffic Signaling, V207, P67
[15]  
MIZUNO K, 2007, 2 INT C KNOWL SCI EN, P73
[16]  
PECK C, 1990, IEE CONF PUBL, V320, P104
[17]   Using intelligent agents for pro-active, real-time urban intersection control [J].
Roozemond, DA .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 131 (02) :293-301
[18]   Genetic algorithms and cellular automata:: A new architecture for traffic light cycles optimization [J].
Sánchez, JJ ;
Galán, M ;
Rubio, E .
CEC2004: PROCEEDINGS OF THE 2004 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2004, :1668-1674
[19]   THE SYDNEY COORDINATED ADAPTIVE TRAFFIC (SCAT) SYSTEM PHILOSOPHY AND BENEFITS [J].
SIMS, AG ;
DOBINSON, KW .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 1980, 29 (02) :130-137
[20]   Neural networks for real-time traffic signal control [J].
Srinivasan, Dipti ;
Choy, Min Chee ;
Cheu, Ruey Long .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2006, 7 (03) :261-272