Market basket analysis of crash data from large jurisdictions and its potential as a decision support tool

被引:103
作者
Pande, Anurag [1 ]
Abdel-Aty, Mohamed [1 ]
机构
[1] Univ Cent Florida, Dept Civil & Environm Engn, Orlando, FL 32816 USA
关键词
Association rules; Crash characteristics; Data mining; Traffic safety; SEVERITY LEVELS; ACCIDENTS;
D O I
10.1016/j.ssci.2007.12.001
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data mining applications are becoming increasingly popular for many applications across a set of very divergent fields. Analysis of crash data is no exception. There are many data mining methodologies that have been applied to crash data in the recent past. However, one particular application conspicuously missing from the traffic safety literature until recently is association analysis or market basket analysis. The methodology is used by retailers all over the world to determine which items are purchased together. In this study, crashes, are analyzed its supermarket transactions to detect interdependence among crash characteristics. The results from the analysis include simple rules that indicate which crash characteristics are associated with each other. The application is demonstrated using non-intersection crash data from the state of Florida for the year 2004. In the proposed methodology no variable needs to be assigned as dependent variable. Hence, it is useful in identifying previously unknown patterns in the data obtained from large jurisdictions (such its the State of Florida) as opposed to the data from a single roadway or intersection. Based oil the association rules discovered from the analysis, it was concluded that there is it significant correlation between lack of illumination and high severity of crashes. Furthermore, it was found that under rainy conditions straight sections with vertical curves are particularly crash prone. Results are consistent with the understanding of crash characteristics and point to the potential of this methodology for the analysis of crash data collected by the state and federal agencies. The potential of this technique may be realized in the form of a decision support tool for the traffic safety administrators, (C) 2008 Published by Elsevier Ltd.
引用
收藏
页码:145 / 154
页数:10
相关论文
共 20 条
[1]   Exploring the overall and specific crash severity levels at signalized intersections [J].
Abdel-Aty, M ;
Keller, J .
ACCIDENT ANALYSIS AND PREVENTION, 2005, 37 (03) :417-425
[3]   Development of artificial neural network models to predict driver injury severity in traffic accidents at signalized intersections [J].
Abdelwahab, HT ;
Abdel-Aty, MA .
HIGHWAY SAFETY: MODELING, ANALYSIS, MANAGEMENT, STATISTICAL METHODS, AND CRASH LOCATION: SAFETY AND HUMAN PERFORMANCE, 2001, (1746) :6-13
[4]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[5]  
[Anonymous], P 1996 ACM SIGMOD IN
[6]  
[Anonymous], 2001, Traffic Safety Facts 2000
[7]   Older drivers and accidents: A meta analysis and data mining application on traffic accident data [J].
Bayam, E ;
Liebowitz, J ;
Agresti, W .
EXPERT SYSTEMS WITH APPLICATIONS, 2005, 29 (03) :598-629
[8]  
Bayardo R.J., 1999, Proc. of the Fifth ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining, P145, DOI [DOI 10.1145/312129.312219, 10.1145/3121312219]
[9]  
Brin S., 1997, SIGMOD Record, V26, P265, DOI [10.1145/253262.253325, 10.1145/253262.253327]
[10]   Analysis of freeway accident frequencies: Negative binomial regression versus artificial neural network [J].
Chang, LY .
SAFETY SCIENCE, 2005, 43 (08) :541-557