Properties of R2 statistics for logistic regression

被引:40
作者
Hu, B
Palta, M
Shao, J
机构
[1] Univ Wisconsin, Dept Populat Hlth Sci, Madison, WI 53726 USA
[2] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
关键词
predictive value; confidence interval; misspecification;
D O I
10.1002/sim.2300
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Various R-2 statistics have been proposed for logistic regression to quantify the extent to which the binary response can be predicted by a given logistic regression model and covariates. We study the asymptotic properties of three popular variance-based R-2 statistics. We find that two variance-based R-2 statistics, the sum of squares and the squared Pearson correlation, have identical asymptotic distribution whereas the third one, Gini's concentration measure, has a different asymptotic behaviour and may overstate the predictivity of the model and covariates when the model is misspecified. Our result not only provides a theoretical basis for the findings in previous empirical and numerical work, but also leads to asymptotic confidence intervals. Statistical variability can then be taken into account when assessing the predictive value of a logistic regression model. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:1383 / 1395
页数:13
相关论文
共 22 条
[1]  
[Anonymous], 1998, Applied regression analysis, DOI 10.1002/9781118625590
[2]  
Ash A, 1999, STAT MED, V18, P375, DOI 10.1002/(SICI)1097-0258(19990228)18:4<375::AID-SIM20>3.0.CO
[3]  
2-J
[4]  
CORDEIRO GM, 1991, J ROY STAT SOC B MET, V53, P629
[5]   ON THE INTERPRETATION AND USE OF R2 IN REGRESSION-ANALYSIS [J].
HELLAND, IS .
BIOMETRICS, 1987, 43 (01) :61-69
[6]  
Hosmer DW, 2004, APPL LOGISTIC REGRES
[7]  
HU B, UNPUB J ECONOMETRICS
[8]  
KENT JT, 1983, BIOMETRIKA, V70, P163, DOI 10.1093/biomet/70.1.163
[9]  
Long J., 1997, SCOTT REGRESSION MOD
[10]  
Mittlbock M, 1996, STAT MED, V15, P1987, DOI 10.1002/(SICI)1097-0258(19961015)15:19<1987::AID-SIM318>3.0.CO