The Analysis of Count Data: A Gentle Introduction to Poisson Regression and Its Alternatives

被引:783
作者
Coxe, Stefany [1 ]
West, Stephen G. [1 ]
Aiken, Leona S. [1 ]
机构
[1] Arizona State Univ, Dept Psychol, Tempe, AZ 85287 USA
关键词
GENERALIZED LINEAR-MODELS;
D O I
10.1080/00223890802634175
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
Count data reflect the number of occurrences of a behavior in a fixed period of time (e.g., number of aggressive acts by children during a playground period). In cases in which the outcome variable is a count with a low arithmetic mean (typically 10), standard ordinary least squares regression may produce biased results. We provide an introduction to regression models that provide appropriate analyses for count data. We introduce standard Poisson regression with an example and discuss its interpretation. Two variants of Poisson regression, overdispersed Poisson regression and negative binomial regression, are introduced that may provide more optimal results when a key assumption of standard Poisson regression is violated. We also discuss the problems of excess zeros in which a subgroup of respondents who would never display the behavior are included in the sample and truncated zeros in which respondents who have a zero count are excluded by the sampling plan. We provide computer syntax for our illustrations in SAS and SPSS. The Poisson family of regression models provides improved and now easy to implement analyses of count data. [Supplementary materials are available for this article. Go to the publisher's online edition of Journal of Personality Assessment for the following free supplemental resources: the data set used to illustrate Poisson regression in this article, which is available in three formatsa text file, an SPSS database, or a SAS database.].
引用
收藏
页码:121 / 136
页数:16
相关论文
共 39 条
[1]  
Aiken L.S., 1991, Multiple Regression: Testing and Interpreting Interactions
[2]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267
[3]  
[Anonymous], 1999, Logistic regression using SAS: Theory and application
[4]  
[Anonymous], 2008, An Introduction to Generalized Linear Models
[5]  
[Anonymous], 1994, 9410 NEW YORK U DEP
[6]  
[Anonymous], 2018, Generalized linear models
[7]   Daily evaluation of anticipated outcomes from alcohol use among college students [J].
Armeli, S ;
Mohr, C ;
Todd, M ;
Maltby, N ;
Tennen, H ;
Carney, MA ;
Affleck, G .
JOURNAL OF SOCIAL AND CLINICAL PSYCHOLOGY, 2005, 24 (06) :767-792
[8]  
Cameron AC, 1997, J ECONOMETRICS, V77, P329
[9]  
Cameron Colin, 2013, Regression Analysis of Count Data
[10]   ON THE DISTRIBUTION OF THE LIKELIHOOD RATIO [J].
CHERNOFF, H .
ANNALS OF MATHEMATICAL STATISTICS, 1954, 25 (03) :573-578