THE PROBLEM OF UNDERESTIMATING THE RESIDUAL ERROR VARIANCE IN FORWARD STEPWISE REGRESSION

被引:30
作者
FREEDMAN, LS
PEE, D
MIDTHUNE, DN
机构
[1] NCI,DIV CANC PREVENT & CONTROL,BIOMETRY BRANCH,BETHESDA,MD 20892
[2] INFORMAT MANAGEMENT SERV INC,ROCKVILLE,MD 20852
[3] INFORMAT MANAGEMENT SERV INC,SILVER SPRING,MD 20904
来源
STATISTICIAN | 1992年 / 41卷 / 04期
关键词
D O I
10.2307/2349005
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Under the global null hypothesis that all covariates are unrelated to the outcome variables, forward stepwise regression procedures should have the property that the probability of selecting a given variable and finding it significant at the alpha level is equal to alpha. Because of the problem of underestimating the residual error variance the actual probability can be very different from alpha. This problem becomes of practical concern when the ratio of the number of variables to the number of observations becomes greater than 0.25, and is more serious for logistic than for linear regression.
引用
收藏
页码:405 / 412
页数:8
相关论文
共 15 条