Predictive performance of the binary logit model in unbalanced samples

被引:115
作者
Cramer, JS [1 ]
机构
[1] Tinbergen Inst, Amsterdam, Netherlands
关键词
goodness of fit; logistic regression; predicted probabilities; unequal sample proportions;
D O I
10.1111/1467-9884.00173
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In a binary logit analysis with unequal sample frequencies of the two outcomes the less frequent outcome always has tower estimated prediction probabilities than the other outcome. This effect is unavoidable, and its extent varies inversely with the fit of the model, as given by a new measure that follows naturally from the argument. Unbalanced samples with a poor fit are typical for survey analyses in the social sciences and epidemiology, and there the difference in prediction probabilities is most acute. It affects two common diagnostics: the within-sample 'percentage correctly predicted' and the identification of outliers. Partial remedies are suggested.
引用
收藏
页码:85 / 94
页数:10
相关论文
共 17 条
[1]  
Afifi AA., 1990, COMPUTER AIDED MULTI
[2]  
BAKKER FM, 1993, EXP APPL ACAROL, V17, P97
[3]   DOSE AND VOLUME EFFECTS ON FIBROSIS AFTER BREAST-CONSERVATION THERAPY [J].
BORGER, JH ;
KEMPERMAN, H ;
SMITT, HS ;
HART, A ;
VANDONGEN, J ;
LEBESQUE, J ;
BARTELINK, H .
INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 1994, 30 (05) :1073-1081
[4]  
CRAMER JS, 1991, LOGIT MODEL INTRO EC
[5]  
CRAMER JS, 1997, 970444 TI
[6]   Predicting the criminal antecedents of a stranger rapist from his offence behaviour [J].
Davies, A ;
Wittebrod, K ;
Jackson, JL .
SCIENCE & JUSTICE, 1997, 37 (03) :161-170
[9]  
Finney D.J., 1977, PROBIT ANAL, VIII
[10]   SEMINONPARAMETRIC ESTIMATION OF BINARY-CHOICE MODELS WITH AN APPLICATION TO LABOR-FORCE PARTICIPATION [J].
GABLER, S ;
LAISNEY, F ;
LECHNER, M .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 1993, 11 (01) :61-80