Reinforcement learning for True Adaptive traffic signal control

被引：357

作者：

Abdulhai, B

Pringle, R

Karakoulas, GJ

机构：

[1] Univ Toronto, Dept Civil Engn, Intelligent Transportat Syst Ctr, Toronto, ON M5S 1A4, Canada

[2] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 1A4, Canada

来源：

JOURNAL OF TRANSPORTATION ENGINEERING | 2003年 / 129卷 / 03期

关键词：

traffic signal controllers; intelligent transportation systems; traffic control; traffic management; adaptive systems;

D O I：

10.1061/(ASCE)0733-947X(2003)129:3(278)

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

The ability to exert real-time, adaptive control of transportation processes is the core of many intelligent transportation systems decision support tools. Reinforcement learning, an artificial intelligence approach undergoing development in the machine-learning community, offers key advantages in this regard. The ability of a control agent to learn relationships between control actions and their effect on the environment while pursuing a goal is a distinct improvement over prespecified models of the environment. Prespecified models are a prerequisite of conventional control methods and their accuracy limits the performance of control agents. This paper contains an introduction to Q-learning, a simple yet powerful reinforcement learning algorithm, and presents a case study involving application to traffic signal control. Encouraging results of the application to an isolated traffic signal, particularly under variable traffic conditions, are presented. A broader research effort is outlined, including extension to linear and networked signal systems and integration with dynamic route guidance. The research objective involves optimal control of heavily congested traffic across a two-dimensional road network-a challenging task for conventional traffic signal control methodologies.

引用

页码：278 / 285

页数：8

共 20 条

[1]

Abdulhai B, 1999, TRANSPORT RES C-EMER, V7, P262

[2]

ABDULHAI B, 1999, P 6 WORLD C INT TRAN

[3]

Albus J. S., 1975, Transactions of the ASME. Series G, Journal of Dynamic Systems, Measurement and Control, V97, P228, DOI 10.1115/1.3426923

[4]

[Anonymous], J DYN SYST MEAS CONT, DOI DOI 10.1115/1.3426922

[5]

[Anonymous], 1996, TRANSP RES REC

[6]

BERTHERTON D, 1996, TRANSPORT RES REC, V1554, P48

[7]

BERTHERTON D, 1998, TRANSPORT RES REC, V1634, P118

[8]

Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st

[9]

BINGHAM E, 1998, THESIS HELSINKI U TE

[10]

HUNT PB, 1981, 1014 TRANSP ROAD RES

← 1 2 →