Reinforcement learning-based multi-agent system for network traffic signal control

被引：385

作者：

Arel, I. ^{[1
]}

Liu, C. ^{[1
]}

Urbanik, T. ^{[2
]}

Kohls, A. G. ^{[2
]}

机构：

[1] Univ Tennessee, Dept Elect Engn & Comp Sci, Knoxville, TN 37996 USA

[2] Univ Tennessee, Dept Civil & Environm Engn, Knoxville, TN 37996 USA

来源：

IET INTELLIGENT TRANSPORT SYSTEMS | 2010年 / 4卷 / 02期

关键词：

D O I：

10.1049/iet-its.2009.0070

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A challenging application of artificial intelligence systems involves the scheduling of traffic signals in multi-intersection vehicular networks. This paper introduces a novel use of a multi-agent system and reinforcement learning (RL) framework to obtain an efficient traffic signal control policy. The latter is aimed at minimising the average delay, congestion and likelihood of intersection cross-blocking. A five-intersection traffic network has been studied in which each intersection is governed by an autonomous intelligent agent. Two types of agents, a central agent and an outbound agent, were employed. The outbound agents schedule traffic signals by following the longest-queue-first (LQF) algorithm, which has been proved to guarantee stability and fairness, and collaborate with the central agent by providing it local traffic statistics. The central agent learns a value function driven by its local and neighbours' traffic conditions. The novel methodology proposed here utilises the Q-Learning algorithm with a feedforward neural network for value function approximation. Experimental results clearly demonstrate the advantages of multi-agent RL-based control over LQF governed isolated single-intersection control, thus paving the way for efficient distributed traffic signal control in complex settings.

引用

页码：128 / 135

页数：8

共 24 条

[1] Reinforcement learning for True Adaptive traffic signal control [J].

Abdulhai, B ;

Pringle, R ;

Karakoulas, GJ .

JOURNAL OF TRANSPORTATION ENGINEERING, 2003, 129 (03) :278-285

[2]

ALBUS JS, 1971, MATH BIOSCI, V2, P25

[3]

Broomhead D. S., 1988, Complex Systems, V2, P321

[4] Adaptive traffic signal control using approximate dynamic programming [J].

Cai, Chen ;

Wong, Chi Kwong ;

Heydecker, Benjamin G. .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2009, 17 (05) :456-474

[5] Multi-agent model predictive control of signaling split in urban traffic networks [J].

de Oliveira, Lucas Barcelos ;

Camponogara, Eduardo .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2010, 18 (01) :120-139

[6] A multivariable regulator approach to traffic-responsive network-wide signal control [J].

Diakaki, C ;

Papageorgiou, M ;

Aboudolas, K .

CONTROL ENGINEERING PRACTICE, 2002, 10 (02) :183-195

[7]

FEHON PK, 2004, ITE DISTR 6 ANN M DK

[8]

Gershenson C, 2005, COMPLEX SYST, V16, P29

[9]

HAYKIN S, 1998, NEURAL NETWORKS COMP, P670

[10]

Jacob C, 2006, TRANSPORT RES REC, P1

← 1 2 3 →