Modeling motor vehicle crashes using Poisson-gamma models: Examining the effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter

被引:249
作者
Lord, Dominique [1 ]
机构
[1] Texas A&M Univ, Zachry Dept Civil Engn, College Stn, TX 77843 USA
关键词
statistical models; Poisson-gamma; low sample mean values; empirical bayes; small sample size;
D O I
10.1016/j.aap.2006.02.001
中图分类号
TB18 [人体工程学];
学科分类号
1201 ;
摘要
There has been considerable research conducted on the development of statistical models for predicting crashes on highway facilities. Despite numerous advancements made for improving the estimation tools of statistical models, the most common probabilistic structure used for modeling motor vehicle crashes remains the traditional Poisson and Poisson-gamma (or Negative Binomial) distribution; when crash data exhibit overdispersion, the Poisson-gamma model is usually the model of choice most favored by transportation safety modelers. Crash data collected for safety studies often have the unusual attributes of being characterized by low sample mean values. Studies have shown that the goodness-of-fit of statistical models produced from such datasets can be significantly affected. This issue has been defined as the "low mean problem" (LMP). Despite recent developments on methods to circumvent the LMP and test the goodness-of-fit of models developed using such datasets, no work has so far examined how the LMP affects the fixed dispersion parameter of Poisson-gamma models used for modeling motor vehicle crashes. The dispersion parameter plays an important role in many types of safety studies and should, therefore, be reliably estimated. The primary objective of this research project was to verify whether the LMP affects the estimation of the dispersion parameter and, if it is, to determine the magnitude of the problem. The secondary objective consisted of determining the effects of an unreliably estimated dispersion parameter on common analyses performed in highway'safety studies. To accomplish the objectives of the study, a series of Poisson-gamma distributions were simulated using different values describing the mean, the dispersion parameter, and the sample size. Three estimators commonly used by transportation safety modelers for estimating the dispersion parameter of Poisson-gamma models were evaluated: the method of moments, the weighted regression, and the maximum likelihood method. In an attempt to complement the outcome of the simulation study, Poisson-gamma models were fitted to crash data collected in Toronto, Ont. characterized by a low sample mean and small sample size. The study shows that a low sample mean combined with a small sample size can seriously affect the estimation of the dispersion parameter, no matter which estimator is used within the estimation process. The probability the dispersion parameter becomes unreliably estimated increases significantly as the sample mean and sample size decrease. Consequently, the results show that an unreliably estimated dispersion parameter can significantly under-mine empirical Bayes (EB) estimates as well as the estimation of confidence intervals for the gamma mean and predicted response. The paper ends with recommendations about minimizing the likelihood of producing Poisson-gamma models with an unreliable dispersion parameter for modeling motor vehicle crashes. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:751 / 766
页数:16
相关论文
共 65 条
[1]  
Abbess C., 1981, Traffic Engineering & Control, V22, P535
[2]  
Abdel-Aty M, 2004, TRANSPORT RES REC, P106
[3]  
[Anonymous], 1997, Observational Before/After Studies in Road Safety. Estimating the Effect of Highway and Traffic Engineering Measures on Road Safety
[4]  
ANSCOMBE FJ, 1950, BIOMETRIKA, V37, P358, DOI 10.1093/biomet/37.3-4.358
[5]  
Cameron A.C., 1986, Journal of Applied Econometrics, V1, P29, DOI [10.1002/jae.3950010104, DOI 10.1002/JAE.3950010104]
[6]   REGRESSION-BASED TESTS FOR OVERDISPERSION IN THE POISSON MODEL [J].
CAMERON, AC ;
TRIVEDI, PK .
JOURNAL OF ECONOMETRICS, 1990, 46 (03) :347-364
[7]  
Cameron AC, 1998, Regression Analysis of Count Data
[8]  
CAMERON AC, 2005, COMMUNICATION
[9]   ESTIMATION OF THE NEGATIVE BINOMIAL PARAMETER-KAPPA BY MAXIMUM QUASI-LIKELIHOOD [J].
CLARK, SJ ;
PERRY, JN .
BIOMETRICS, 1989, 45 (01) :309-316
[10]   VARIANCE FUNCTION ESTIMATION [J].
DAVIDIAN, M ;
CARROLL, RJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (400) :1079-1091