Quality of location-based crowdsourced speed data on surface streets: A case study of Waze and Bluetooth speed data in Sevierville, TN

被引:35
作者
Hoseinzadeh, Nima [1 ]
Liu, Yuandong [1 ]
Han, Lee D. [1 ]
Brakewood, Candace [1 ]
Mohammadnazar, Amin [1 ]
机构
[1] Univ Tennessee, Dept Civil & Environm Engn, Knoxville, TN 37996 USA
关键词
Location-based data; Crowdsourced data; Waze; Bluetooth; Big data; Smart cities; Surface streets; RELIABILITY; FREEWAY; COVERAGE;
D O I
10.1016/j.compenvurbsys.2020.101518
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Obtaining accurate speed and travel time information is a challenge for researchers, geographers, and transportation agencies. In the past, traffic data were usually acquired and disseminated by government agencies through fixed-location sensors. High costs, infrastructure demands, and low coverage levels of these sensor devices require agencies and researchers to look beyond the traditional approaches. With the emergence of smartphones and navigation apps, location-based and crowdsourced Big Data are receiving increased attention. In this regard, location-based big data (LocBigData) collected from probe vehicles and road users can be used to provide speed and travel time information in different locations. Examining the quality of crowdsourced data is essential for researchers and agencies before using them. This study assessed the quality of Waze speed data from surface streets and conducted a case study in Sevierville, Tennessee. Typically, examining the quality of these data in surface streets and arterials is more challenging than freeways data. This research used Bluetooth speed data as the ground truth, which is independent of Waze data. In this study, three steps of methodology were used. In the first step, Waze speed data was compared to Bluetooth data in terms of accuracy, mean difference, and distribution similarity. In the second step, a k-means algorithm was used to categorize Waze data quality, and a multinomial logistics regression model was performed to explore the significant factors that impact data quality. Finally, in the third step, machine learning techniques were conducted to predict the data quality in different conditions. The result of the comparison showed a similar pattern and a slight difference between datasets, which verified the quality of Waze speed data. The statistical model indicates that that Waze speed data are more accurate in peak hours than in night hours. Also, the traffic speed, traffic volume, and segment length have a significant association on the accuracy of Waze data on surface streets. Finally, the result of machine learning prediction showed that a KNN method performed the highest prediction accuracy of 84.5% and 82.9% of the time for training and test datasets, respectively. Overall, the study results suggest that Waze speed data is a promising data source for surface streets.
引用
收藏
页数:14
相关论文
共 66 条
[1]   Framework for Evaluating the Reliability of Wide-Area Probe Data [J].
Adu-Gyamfi, Yaw Okyere ;
Sharma, Anuj ;
Knickerbocker, Skylar ;
Hawkins, Neal ;
Jackson, Michael .
TRANSPORTATION RESEARCH RECORD, 2017, 2643 (01) :93-104
[2]   Quantitative analysis of probe data characteristics: Coverage, speed bias and congestion detection precision [J].
Ahsani, Vesal ;
Amin-Naseri, Mostafa ;
Knickerbocker, Skylar ;
Sharma, Anuj .
JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 23 (02) :103-119
[3]  
Alhajri F., 2014, A Comparative Assessment of Crowded Source Travel Time Estimates: A Case Study of Bluetooth vs INRIX on a Suburban Arterial
[4]   Bluetooth Sensor Data and Ground Truth Testing of Reported Travel Times [J].
Aliari, Yashar ;
Haghani, Ali .
TRANSPORTATION RESEARCH RECORD, 2012, (2308) :167-172
[5]   Evaluating the Reliability, Coverage, and Added Value of Crowdsourced Traffic Incident Reports from Waze [J].
Amin-Naseri, Mostafa ;
Chakraborty, Pranamesh ;
Sharma, Anuj ;
Gilbert, Stephen B. ;
Hong, Mingyi .
TRANSPORTATION RESEARCH RECORD, 2018, 2672 (43) :34-43
[6]   Reliability of Bluetooth Technology for Travel Time Estimation [J].
Araghi, Bahar Namaki ;
Olesen, Jonas Hammershoj ;
Krishnan, Rajesh ;
Christensen, Lars Torholm ;
Lahrmann, Harry .
JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 19 (03) :240-255
[7]  
Azad M., 2019, TRANSP RES BOARD 98
[8]   Severity analysis for large truck rollover crashes using a random parameter ordered logit model [J].
Azimi, Ghazaleh ;
Rahimi, Alireza ;
Asgari, Hamidreza ;
Jin, Xia .
ACCIDENT ANALYSIS AND PREVENTION, 2020, 135
[9]   Fusing a Bluetooth Traffic Monitoring System With Loop Detector Data for Improved Freeway Traffic Speed Estimation [J].
Bachmann, Chris ;
Roorda, Matthew J. ;
Abdulhai, Baher ;
Moshiri, Behzad .
JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2013, 17 (02) :152-164
[10]  
Bahaweres RB, 2017, 2017 5TH INTERNATIONAL CONFERENCE ON CYBER AND IT SERVICE MANAGEMENT (CITSM 2017), P1