Quality of location-based crowdsourced speed data on surface streets: A case study of Waze and Bluetooth speed data in Sevierville, TN

被引:35
作者
Hoseinzadeh, Nima [1 ]
Liu, Yuandong [1 ]
Han, Lee D. [1 ]
Brakewood, Candace [1 ]
Mohammadnazar, Amin [1 ]
机构
[1] Univ Tennessee, Dept Civil & Environm Engn, Knoxville, TN 37996 USA
关键词
Location-based data; Crowdsourced data; Waze; Bluetooth; Big data; Smart cities; Surface streets; RELIABILITY; FREEWAY; COVERAGE;
D O I
10.1016/j.compenvurbsys.2020.101518
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Obtaining accurate speed and travel time information is a challenge for researchers, geographers, and transportation agencies. In the past, traffic data were usually acquired and disseminated by government agencies through fixed-location sensors. High costs, infrastructure demands, and low coverage levels of these sensor devices require agencies and researchers to look beyond the traditional approaches. With the emergence of smartphones and navigation apps, location-based and crowdsourced Big Data are receiving increased attention. In this regard, location-based big data (LocBigData) collected from probe vehicles and road users can be used to provide speed and travel time information in different locations. Examining the quality of crowdsourced data is essential for researchers and agencies before using them. This study assessed the quality of Waze speed data from surface streets and conducted a case study in Sevierville, Tennessee. Typically, examining the quality of these data in surface streets and arterials is more challenging than freeways data. This research used Bluetooth speed data as the ground truth, which is independent of Waze data. In this study, three steps of methodology were used. In the first step, Waze speed data was compared to Bluetooth data in terms of accuracy, mean difference, and distribution similarity. In the second step, a k-means algorithm was used to categorize Waze data quality, and a multinomial logistics regression model was performed to explore the significant factors that impact data quality. Finally, in the third step, machine learning techniques were conducted to predict the data quality in different conditions. The result of the comparison showed a similar pattern and a slight difference between datasets, which verified the quality of Waze speed data. The statistical model indicates that that Waze speed data are more accurate in peak hours than in night hours. Also, the traffic speed, traffic volume, and segment length have a significant association on the accuracy of Waze data on surface streets. Finally, the result of machine learning prediction showed that a KNN method performed the highest prediction accuracy of 84.5% and 82.9% of the time for training and test datasets, respectively. Overall, the study results suggest that Waze speed data is a promising data source for surface streets.
引用
收藏
页数:14
相关论文
共 66 条
[41]   Social Media data: Challenges, opportunities and limitations in urban studies [J].
Marti, Pablo ;
Serrano-Estrada, Leticia ;
Nolasco-Cirugeda, Almudena .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2019, 74 :161-174
[42]   Crowdsourcing and Its Application to Transportation Data Collection and Management [J].
Misra, Aditi ;
Gooze, Aaron ;
Watkins, Kari ;
Asad, Mariam ;
Le Dantec, Christopher A. .
TRANSPORTATION RESEARCH RECORD, 2014, (2414) :1-8
[43]  
Mousavi S. M., 2020, INT C TRANSP DEV
[44]  
MOUSAVI SM, 2019, INT C TRANSP DEV
[45]  
Nasr Esfahani H., 2019, J TRANSP SAF SECUR, P1
[46]  
Oliveira ACM, 2019, COMPUT ENVIRON URBAN, V77, DOI [10.1016/j.compenwrbsys.2017.08.006, 10.1016/j.compenvurbsys.2017.08.006]
[47]   Evaluation of Wide-Area Traffic Monitoring Technologies for Travel Time Studies [J].
Omrani, Reza ;
Izadpanah, Pedram ;
Nikolic, Goran ;
Hellinga, Bruce ;
Hadayeghi, Alireza ;
Abdelgawad, Hossam .
TRANSPORTATION RESEARCH RECORD, 2013, (2380) :108-119
[48]  
Pack M, 2017, ITE J, V87, P28
[49]   Trip distribution modeling with Twitter data [J].
Pourebrahim, Nastaran ;
Sultana, Selima ;
Niakanlahiji, Amirreza ;
Thill, Jean-Claude .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2019, 77
[50]   Trip Travel-Time Reliability: Issues and Proposed Solutions [J].
Rakha, Hesham ;
El-Shawarby, Ihab ;
Arafeh, Mazen .
JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2010, 14 (04) :232-250