Parameter estimation and goodness-of-fit in log binomial regression

被引:121
作者
Blizzard, L
Hosmer, DW
机构
[1] Univ Tasmania, Menzies Res Inst, Hobart, Tas 7001, Australia
[2] Univ Massachusetts, Dept Publ Hlth, Amherst, MA 01003 USA
关键词
binary regression; odds ratio; risk ratio; prevalence ratio; logistic regression;
D O I
10.1002/bimj.200410165
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
An estimate of the risk, adjusted for confounders, can be obtained from a fitted logistic regression model, but it substantially over-estimates when the outcome is not rare. The log binomial model, binomial errors and log link, is increasingly being used for this purpose. However this model's performance, goodness of fit tests and case-wise diagnostics have not been studied. Extensive simulations are used to compare the performance of the log binomial, a logistic regression based method proposed by Schouten et al. (1993) and a Poisson regression approach proposed by Zou (2004) and Carter, Lipsitz, and Tilley (2005). Log binomial regression resulted in "failure" rates (non-convergence, out-of-bounds predicted probabilities) as high as 59%. Estimates by the method of Schouten et al. (1993) produced fitted log binomial probabilities greater than unity in up to 19% of samples to which a log binomial model had been successfully fit and in up to 78% of samples when the log binomial model fit failed. Similar percentages were observed for the Poisson regression approach. Coefficient and standard error estimates from the three models were similar. Rejection rates for goodness of fit tests for log binomial fit were around 5%. Power of goodness of fit tests was modest when an incorrect logistic regression model was fit. Examples demonstrate the use of the methods. Uncritical use of the log binomial regression model is not recommended.
引用
收藏
页码:5 / 22
页数:18
相关论文
共 25 条
[1]  
[Anonymous], 2003, P 28 ANN SAS US GROU
[2]  
Bertolini G, 2000, J Epidemiol Biostat, V5, P251
[3]   Parental smoking and infant respiratory infection: How important is not smoking in the same room with the baby? [J].
Blizzard, L ;
Ponsonby, AL ;
Dwyer, T ;
Venn, A ;
Cochrane, JA .
AMERICAN JOURNAL OF PUBLIC HEALTH, 2003, 93 (03) :482-488
[4]   Quasi-likelihood estimation for relative risk regression models [J].
Carter, RE ;
Lipsitz, SR ;
Tilley, BC .
BIOSTATISTICS, 2005, 6 (01) :39-44
[5]  
EFRON B, 1978, BIOMETRIKA, V65, P457, DOI 10.1093/biomet/65.3.457
[6]   Model-based estimation of relative risks and other epidemiologic measures in studies of common outcomes and in case-control studies [J].
Greenland, S .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2004, 160 (04) :301-305
[7]   INTERPRETATION AND CHOICE OF EFFECT MEASURES IN EPIDEMIOLOGIC ANALYSES [J].
GREENLAND, S .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1987, 125 (05) :761-768
[8]   GOODNESS OF FIT TESTS FOR THE MULTIPLE LOGISTIC REGRESSION-MODEL [J].
HOSMER, DW ;
LEMESHOW, S .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1980, 9 (10) :1043-1069
[9]  
Hosmer DW, 1997, STAT MED, V16, P965
[10]  
Hosmer W., 2000, Applied Logistic Regression, VSecond