The use of resampling methods to simplify regression models in medical statistics

被引:241
作者
Sauerbrei, W [1 ]
机构
[1] Univ Freiburg, Inst Med Biometry & Med Informat, D-79104 Freiburg, Germany
关键词
backward elimination; bootstrap; cross-validation; model complexity; prediction; selection bias; selection level;
D O I
10.1111/1467-9876.00155
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The number of variables in a regression model is often too large and a more parsimonious model may be preferred. Selection strategies (e.g. all-subset selection with various penalties for model complexity, or stepwise procedures) are widely used, but there are few analytical results about their properties. The problems of replication stability, model complexity, selection bias and an overoptimistic estimate of the predictive value of a model are discussed together with several proposals based on resampling methods. The methods are applied to data from a case-control study on atopic dermatitis and a clinical trial to compare two chemotherapy regimes by using a logistic regression and a Cox model. A recent proposal to use shrinkage factors to reduce the bias of parameter estimates caused by model building is extended to parameterwise shrinkage factors and is discussed as a further possibility to illustrate problems of models which are too complex. The results from the resampling approaches favour greater simplicity of the final regression model.
引用
收藏
页码:313 / 329
页数:17
相关论文
共 31 条
[1]  
[Anonymous], 1990, SUBSET SELECTION REG, DOI DOI 10.1007/978-1-4899-2939-6
[2]   INFLUENCE OF MODEL-BUILDING STRATEGIES ON THE RESULTS OF A CASE-CONTROL STUDY [J].
BLETTNER, M ;
SAUERBREI, W .
STATISTICS IN MEDICINE, 1993, 12 (14) :1325-1338
[4]   BETTER SUBSET REGRESSION USING THE NONNEGATIVE GARROTE [J].
BREIMAN, L .
TECHNOMETRICS, 1995, 37 (04) :373-384
[5]   Model selection: An integral part of inference [J].
Buckland, ST ;
Burnham, KP ;
Augustin, NH .
BIOMETRICS, 1997, 53 (02) :603-618
[6]   MODEL UNCERTAINTY, DATA MINING AND STATISTICAL-INFERENCE [J].
CHATFIELD, C .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1995, 158 :419-466
[7]   THE BOOTSTRAP AND IDENTIFICATION OF PROGNOSTIC FACTORS VIA COX PROPORTIONAL HAZARDS REGRESSION-MODEL [J].
CHEN, CH ;
GEORGE, SL .
STATISTICS IN MEDICINE, 1985, 4 (01) :39-46
[8]  
COPAS JB, 1983, J R STAT SOC B, V45, P311
[9]  
COPAS JB, 1991, STATISTICIAN, V40, P51
[10]  
COX DR, 1972, J R STAT SOC B, V34, P187