DUMMY VARIABLES IN STEPWISE REGRESSION

被引:32
作者
COHEN, A
机构
关键词
CATEGORICAL REGRESSORS; FORWARD SELECTION;
D O I
10.2307/2684296
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This note discusses a problem that might occur when forward stepwise regression is used for variable selection and among the candidate variables is a categorical variable with more than two categories. Most software packages (such as SAS, SPSS(X), BMDP) include special programs for performing stepwise regression. The user of these programs has to code categorical variables with dummy variables. In this case the forward selection might wrongly indicate that a categorical variable with more than two categories is nonsignificant. This is a disadvantage of the forward selection compared with the backward elimination method. A way to avoid the problem would be to test in a single step all dummy variables corresponding to the same categorical variable rather than one dummy variable at a time, such as in the analysis of covariance. This option, however, is not available in forward stepwise procedures, except for stepwise logistic regression in BMDP. A practical possibility is to repeat the forward stepwise regression and change the reference categories each time.
引用
收藏
页码:226 / 228
页数:3
相关论文
共 13 条
[1]  
[Anonymous], 1977, REGRESSION ANAL EXAM
[2]  
BRORSSON B, 1988, ACCIDENT ANAL PREV, V19, P367
[3]  
Calvert S.E., 1976, CHEM OCEANOGR, V6, P187, DOI DOI 10.1016/B978-0-12-588606-2.50014-X
[4]  
DIXON WJ, 1987, BMDP STATISTICAL SOF
[5]  
DRAPER NR, 1981, APPLIED REGRESSION A
[6]  
Hosmer DW, 1989, APPLIED LOGISTIC REG
[7]  
KRUMGALZ BS, 1989, UNPUB GRAIN SIZE EFF
[8]   MULTIVARIATE ANALYSES OF FACTORS AFFECTING WORK ROLE CENTRALITY OF OCCUPATIONAL CATEGORIES [J].
MANNHEIM, B ;
COHEN, A .
HUMAN RELATIONS, 1978, 31 (06) :525-553
[9]   WHY STEPDOWN PROCEDURES IN VARIABLE SELECTION [J].
MANTEL, N .
TECHNOMETRICS, 1970, 12 (03) :621-&
[10]  
Montgomery DC., 2021, INTRO LINEAR REGRESS