Reinforcement learning-based multi-agent system for network traffic signal control

被引:385
作者
Arel, I. [1 ]
Liu, C. [1 ]
Urbanik, T. [2 ]
Kohls, A. G. [2 ]
机构
[1] Univ Tennessee, Dept Elect Engn & Comp Sci, Knoxville, TN 37996 USA
[2] Univ Tennessee, Dept Civil & Environm Engn, Knoxville, TN 37996 USA
关键词
D O I
10.1049/iet-its.2009.0070
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A challenging application of artificial intelligence systems involves the scheduling of traffic signals in multi-intersection vehicular networks. This paper introduces a novel use of a multi-agent system and reinforcement learning (RL) framework to obtain an efficient traffic signal control policy. The latter is aimed at minimising the average delay, congestion and likelihood of intersection cross-blocking. A five-intersection traffic network has been studied in which each intersection is governed by an autonomous intelligent agent. Two types of agents, a central agent and an outbound agent, were employed. The outbound agents schedule traffic signals by following the longest-queue-first (LQF) algorithm, which has been proved to guarantee stability and fairness, and collaborate with the central agent by providing it local traffic statistics. The central agent learns a value function driven by its local and neighbours' traffic conditions. The novel methodology proposed here utilises the Q-Learning algorithm with a feedforward neural network for value function approximation. Experimental results clearly demonstrate the advantages of multi-agent RL-based control over LQF governed isolated single-intersection control, thus paving the way for efficient distributed traffic signal control in complex settings.
引用
收藏
页码:128 / 135
页数:8
相关论文
共 24 条
[1]   Reinforcement learning for True Adaptive traffic signal control [J].
Abdulhai, B ;
Pringle, R ;
Karakoulas, GJ .
JOURNAL OF TRANSPORTATION ENGINEERING, 2003, 129 (03) :278-285
[2]  
ALBUS JS, 1971, MATH BIOSCI, V2, P25
[3]  
Broomhead D. S., 1988, Complex Systems, V2, P321
[4]   Adaptive traffic signal control using approximate dynamic programming [J].
Cai, Chen ;
Wong, Chi Kwong ;
Heydecker, Benjamin G. .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2009, 17 (05) :456-474
[5]   Multi-agent model predictive control of signaling split in urban traffic networks [J].
de Oliveira, Lucas Barcelos ;
Camponogara, Eduardo .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2010, 18 (01) :120-139
[6]   A multivariable regulator approach to traffic-responsive network-wide signal control [J].
Diakaki, C ;
Papageorgiou, M ;
Aboudolas, K .
CONTROL ENGINEERING PRACTICE, 2002, 10 (02) :183-195
[7]  
FEHON PK, 2004, ITE DISTR 6 ANN M DK
[8]  
Gershenson C, 2005, COMPLEX SYST, V16, P29
[9]  
HAYKIN S, 1998, NEURAL NETWORKS COMP, P670
[10]  
Jacob C, 2006, TRANSPORT RES REC, P1