Understanding Transportation Modes Based on GPS Data for Web Applications

被引:459
作者
Zheng, Yu [1 ]
Chen, Yukun [2 ]
Li, Quannan [3 ]
Xie, Xing [1 ]
Ma, Wei-Ying [1 ]
机构
[1] Microsoft Res Asia, Beijing 100190, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci, Beijing 100084, Peoples R China
[3] Huazhong Univ Sci & Technol, Wuhan 430074, Peoples R China
关键词
Algorithm; Design; Experimentation; Spatial data mining; GPS trajectory; ubiquitous computing; understanding user behavior; GeoLife; user mobility; transportation modes;
D O I
10.1145/1658373.1658374
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
User mobility has given rise to a variety of Web applications, in which the global positioning system (GPS) plays many important roles in bridging between these applications and end users. As a kind of human behavior, transportation modes, such as walking and driving, can provide pervasive computing systems with more contextual information and enrich a user's mobility with informative knowledge. In this article, we report on an approach based on supervised learning to automatically infer users' transportation modes, including driving, walking, taking a bus and riding a bike, from raw GPS logs. Our approach consists of three parts: a change point-based segmentation method, an inference model and a graph-based post-processing algorithm. First, we propose a change point-based segmentation method to partition each GPS trajectory into separate segments of different transportation modes. Second, from each segment, we identify a set of sophisticated features, which are not affected by differing traffic conditions ( e. g., a person's direction when in a car is constrained more by the road than any change in traffic conditions). Later, these features are fed to a generative inference model to classify the segments of different modes. Third, we conduct graph-based postprocessing to further improve the inference performance. This postprocessing algorithm considers both the commonsense constraints of the real world and typical user behaviors based on locations in a probabilistic manner. The advantages of our method over the related works include three aspects. ( 1) Our approach can effectively segment trajectories containing multiple transportation modes. ( 2) Our work mined the location constraints from user-generated GPS logs, while being independent of additional sensor data and map information like road networks and bus stops. ( 3) The model learned from the dataset of some users can be applied to infer GPS data from others. Using the GPS logs collected by 65 people over a period of 10 months, we evaluated our approach via a set of experiments. As a result, based on the change-point-based segmentation method and Decision Tree-based inference model, we achieved prediction accuracy greater than 71 percent. Further, using the graph-based post-processing algorithm, the performance attained a 4-percent enhancement.
引用
收藏
页数:36
相关论文
共 22 条
[1]
Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
[2]
[Anonymous], 2000, SURVEY CONTEXT AWARE
[3]
[Anonymous], P 5 INT C UB COMP OC
[4]
Using GPS to learn significant locations and predict movement across multiple users [J].
Ashbrook, Daniel ;
Starner, Thad .
PERSONAL AND UBIQUITOUS COMPUTING, 2003, 7 (05) :275-286
[5]
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
[6]
Counts S., 2007, P 15 ANN ACM INT S A, P10
[7]
Detection of daily activities and sports with wearable sensors in controlled and uncontrolled conditions [J].
Ermes, Miikka ;
Parkka, Juha ;
Mantyjarvi, Jani ;
Korhonen, Ilkka .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2008, 12 (01) :20-26
[8]
Hariharan R, 2004, LECT NOTES COMPUT SC, V3234, P106
[9]
Krumm J, 2004, PROCEEDINGS OF MOBIQUITOUS 2004, P4
[10]
Krumm J, 2006, LECT NOTES COMPUT SC, V4206, P243