Reinforcement learning for True Adaptive traffic signal control

被引:357
作者
Abdulhai, B
Pringle, R
Karakoulas, GJ
机构
[1] Univ Toronto, Dept Civil Engn, Intelligent Transportat Syst Ctr, Toronto, ON M5S 1A4, Canada
[2] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 1A4, Canada
关键词
traffic signal controllers; intelligent transportation systems; traffic control; traffic management; adaptive systems;
D O I
10.1061/(ASCE)0733-947X(2003)129:3(278)
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The ability to exert real-time, adaptive control of transportation processes is the core of many intelligent transportation systems decision support tools. Reinforcement learning, an artificial intelligence approach undergoing development in the machine-learning community, offers key advantages in this regard. The ability of a control agent to learn relationships between control actions and their effect on the environment while pursuing a goal is a distinct improvement over prespecified models of the environment. Prespecified models are a prerequisite of conventional control methods and their accuracy limits the performance of control agents. This paper contains an introduction to Q-learning, a simple yet powerful reinforcement learning algorithm, and presents a case study involving application to traffic signal control. Encouraging results of the application to an isolated traffic signal, particularly under variable traffic conditions, are presented. A broader research effort is outlined, including extension to linear and networked signal systems and integration with dynamic route guidance. The research objective involves optimal control of heavily congested traffic across a two-dimensional road network-a challenging task for conventional traffic signal control methodologies.
引用
收藏
页码:278 / 285
页数:8
相关论文
共 20 条
[1]  
Abdulhai B, 1999, TRANSPORT RES C-EMER, V7, P262
[2]  
ABDULHAI B, 1999, P 6 WORLD C INT TRAN
[3]  
Albus J. S., 1975, Transactions of the ASME. Series G, Journal of Dynamic Systems, Measurement and Control, V97, P228, DOI 10.1115/1.3426923
[4]  
[Anonymous], J DYN SYST MEAS CONT, DOI DOI 10.1115/1.3426922
[5]  
[Anonymous], 1996, TRANSP RES REC
[6]  
BERTHERTON D, 1996, TRANSPORT RES REC, V1554, P48
[7]  
BERTHERTON D, 1998, TRANSPORT RES REC, V1634, P118
[8]  
Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
[9]  
BINGHAM E, 1998, THESIS HELSINKI U TE
[10]  
HUNT PB, 1981, 1014 TRANSP ROAD RES