Multinomial goodness-of-fit tests for logistic regression models

被引:118
作者
Fagerland, Morten W. [1 ]
Hosmer, David W. [2 ]
Bofin, Anna M. [3 ]
机构
[1] Ullevaal Univ Hosp, Clin Res Ctr, N-0407 Oslo, Norway
[2] Univ Vermont, Dept Math & Stat, Burlington, VT 05405 USA
[3] Norwegian Univ Sci & Technol, Fac Med, Dept Lab Med, N-7034 Trondheim, Norway
关键词
logistic regression; goodness-of-fit; multinomial regression; generalized linear models; simulations;
D O I
10.1002/sim.3202
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We examine the properties of several tests for goodness-of-fit for multinomial logistic regression. One test is based on a strategy of sorting the observations according to the complement of the estimated probability for the reference outcome category and then grouping the subjects into g equal-sized groups. A gxc contingency table, where c is the number of values of the outcome variable. is constructed. The test statistic, denoted as C-g, is obtained by calculating the Pearson chi(2) statistic where the estimated expected frequencies are the sum of the model-based estimated logistic probabilities. Simulations compare the properties of C-g with those of the ungrouped Pearson chi(2) test (X-2) and its normalized test (z). The null distribution of C-g is well approximated by the chi(2) distribution with (g-2) x (c-1) degrees of freedom. The sampling distribution of X-2 is compared with a chi(2) distribution with n x (c- 1) degrees of freedom but shows erratic behavior. With a few exceptions, the sampling distribution of z adheres reasonably well to the standard normal distribution. Power simulations show that C-g has low power for a sample of 100 observations, but satisfactory power for a sample of 400. The tests are illustrated using data from a study of cytological criteria for the diagnosis of breast tumors. Copyright (c) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:4238 / 4253
页数:16
相关论文
共 15 条
[1]   CALCULATION OF POLYCHOTOMOUS LOGISTIC-REGRESSION PARAMETERS USING INDIVIDUALIZED REGRESSIONS [J].
BEGG, CB ;
GRAY, R .
BIOMETRIKA, 1984, 71 (01) :11-18
[2]  
Bertsekas D. P, 2000, DYNAMIC PROGRAMMING, V1
[3]   Cytological criteria for the diagnosis of intraductal hyperplasia, ductal carcinoma in situ, and invasive carcinoma of the breast [J].
Bofin, AM ;
Lydersen, S ;
Hagmar, BM .
DIAGNOSTIC CYTOPATHOLOGY, 2004, 31 (04) :207-215
[4]  
Bull S., 1994, WILEY S PRO, P249
[5]   A goodness-of-fit test for multinomial logistic regression [J].
Goeman, Jelle J. ;
le Cessie, Saskia .
BIOMETRICS, 2006, 62 (04) :980-985
[6]   GOODNESS OF FIT TESTS FOR THE MULTIPLE LOGISTIC REGRESSION-MODEL [J].
HOSMER, DW ;
LEMESHOW, S .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1980, 9 (10) :1043-1069
[7]   Goodness-of-fit processes for logistic regression: simulation results [J].
Hosmer, DW ;
Hjort, NL .
STATISTICS IN MEDICINE, 2002, 21 (18) :2723-2738
[8]  
Hosmer DW, 1997, STAT MED, V16, P965
[9]  
HOSMER DW, 2000, APPL LOGISTIC
[10]   MULTIPLE-GROUP LOGISTIC-REGRESSION DIAGNOSTICS [J].
LESAFFRE, E ;
ALBERT, A .
APPLIED STATISTICS-JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C, 1989, 38 (03) :425-440