Arterial travel time forecast with streaming data: A hybrid approach of flow modeling and machine learning

被引:175
作者
Hofleitner, Aude [1 ]
Herring, Ryan [2 ]
Bayen, Alexandre [1 ,3 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Ind Engn & Operat Res, Apple Inc, Cupertino, CA USA
[3] Univ Calif Berkeley, Dept Civil & Environm Engn, Berkeley, CA 94720 USA
关键词
Arterial traffic; Estimation; Forecast; Streaming data; Machine learning; GPS probe data; SPACE NEURAL-NETWORKS; TRAFFIC FLOW; WAVES;
D O I
10.1016/j.trb.2012.03.006
中图分类号
F [经济];
学科分类号
020101 [政治经济学];
摘要
This article presents a hybrid modeling framework for estimating and predicting arterial traffic conditions using streaming GPS probe data. The model is based on a well-established theory of traffic flow through signalized intersections and is combined with a machine learning framework to both learn static parameters of the roadways (such as free flow velocity or traffic signal parameters) as well as to estimate and predict travel times through the arterial network. The machine learning component of the approach uses the significant amount of historical data collected by the Mobile Millennium system since March 2009 with over 500 probe vehicles reporting their position once per minute in San Francisco, CA. The hybrid model provides a distinct advantage over pure statistical or pure traffic theory models in that it is robust to noisy data (due to the large volumes of historical data) and it produces forecasts using traffic flow theory principles consistent with the physics of traffic. Validation of the model is performed in two different ways. First, a large scale test of the model is performed by splitting the data source into two sets, using the first to produce the estimates and the second to validate them. Second, an alternate validation approach is presented. It consists of a 3-day experiment in which GPS data was collected once per second from 20 drivers on four routes through San Francisco, allowing for precise calculation of actual travel times. The model is run by down-sampling the data and validated using the travel times from these 20 drivers. The results indicate that this approach is a significant step forward in estimating traffic states throughout the arterial network using a relatively small amount of real-time data. The estimates from our model are compared to those given by a data-driven baseline algorithm, for which we achieve a 16% improvement in terms of the root mean squared error of travel time estimates. The primary reason for success is the reliance on a flow model of traffic, which ensures that estimates are consistent with the physics of traffic. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1097 / 1122
页数:26
相关论文
共 52 条
[1]
[Anonymous], 1999, Learning in Graphical Models
[2]
[Anonymous], 2010, 89 TRANSP RES BOARD
[3]
A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking [J].
Arulampalam, MS ;
Maskell, S ;
Gordon, N ;
Clapp, T .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (02) :174-188
[4]
Bails C., 2012, P 91 TRANSP RES BOAR
[5]
Ban X., 2009, P 88 TRANSP RES BOAR
[6]
Bayen A., 2011, TECHNICAL REPORT
[7]
Measuring traffic [J].
Bickel, Peter J. ;
Chen, Chao ;
Kwon, Jaimyoung ;
Rice, John ;
van Zwet, Erik ;
Varaiya, Pravin .
STATISTICAL SCIENCE, 2007, 22 (04) :581-597
[8]
A GENERAL PHASE TRANSITION MODEL FOR VEHICULAR TRAFFIC [J].
Blandin, S. ;
Work, D. ;
Goatin, P. ;
Piccoli, B. ;
Bayen, A. .
SIAM JOURNAL ON APPLIED MATHEMATICS, 2011, 71 (01) :107-127
[9]
Boxma O.J., 2006, TRANSPORTATION SCI, V40
[10]
Boyen X., 1998, Uncertainty in Artificial Intelligence. Proceedings of the Fourteenth Conference (1998), P33