Regression models for count data in R

被引:1822
作者
Zeileis, Achim [1 ]
Kleiber, Christian [2 ]
Jackman, Simon [3 ]
机构
[1] Vienna Univ Econ & Business Adm, Dept Math & Stat, A-1090 Vienna, Austria
[2] Univ Basel, CH-4003 Basel, Switzerland
[3] Stanford Univ, Stanford, CA 94305 USA
来源
JOURNAL OF STATISTICAL SOFTWARE | 2008年 / 27卷 / 08期
关键词
GLM; Poisson model; negative binomial model; hurdle model; zero-inflated model;
D O I
10.18637/jss.v027.i08
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The classical Poisson, geometric and negative binomial regression models for count data belong to the family of generalized linear models and are available at the core of the statistics toolbox in the R system for statistical computing. After reviewing the conceptual and computational features of these methods, a new implementation of hurdle and zero-inflated regression models in the functions hurdle () and zeroinfl () from the package pscl is introduced. It re-uses design and functionality of the basic R functions just as the underlying conceptual tools extend the classical models. Both hurdle and zero-in inflated model, are able to incorporate over-dispersion and excess zeros-two problems that typically occur in count data sets in economics and the social sciences-better than their classical counterparts. Using cross-section data on the demand for medical care, it is illustrated how the classical as well as the zero-augmented models can be fitted, inspected and tested in practice.
引用
收藏
页码:1 / 25
页数:25
相关论文
共 23 条
[1]  
Cameron A.C., 2005, MICROECONOMETRICS ME, DOI DOI 10.1017/CBO9780511811241
[2]  
Cameron AC, 1998, Regression Analysis of Count Data
[3]  
Chambers JM., 1992, Statistical models in S
[4]  
Deb P, 1997, J APPL ECONOMET, V12, P313, DOI 10.1002/(SICI)1099-1255(199705)12:3<313::AID-JAE440>3.0.CO
[5]  
2-G
[6]  
ERHARDT V, 2008, ZIGP ZERO INFLATED G
[7]  
Fox J., 2002, An R and S-Plus Companion to Applied Regression, DOI DOI 10.1016/J.CARBON.2010.02.029
[8]   The R Package geepack for Generalized Estimating Equations [J].
Halekoh, U ;
Hojsgaard, S ;
Yan, J .
JOURNAL OF STATISTICAL SOFTWARE, 2006, 15 (02) :1-11
[9]  
Jackman S., 2008, PSCL CLASSES METHODS
[10]  
Kleiber C, 2008, USE R, P1, DOI 10.1007/978-0-387-77318-6_1