Multivariable regression model building by using fractional polynomials: Description of SAS, STATA and R programs

被引:278
作者
Sauerbrei, W.
Meier-Hirmer, C.
Benner, A.
Royston, P.
机构
[1] Univ Hosp Freiburg, Inst Med Biometry & Med Informat, D-79104 Freiburg, Germany
[2] SNCF, Paris, France
[3] Deutsch Krebsforschungszentrum, D-6900 Heidelberg, Germany
[4] MRC, Clin Trials Unit, London, England
基金
英国医学研究理事会;
关键词
multivariable model building; function selection; fractional polynomials; programs;
D O I
10.1016/j.csda.2005.07.015
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In fitting regression models data analysts are often faced with many predictor variables which may influence the outcome. Several strategies for selection of variables to identify a subset of 'important, predictors are available for many years. A further issue to model building is how to deal with nonlinearity in the relationship between outcome and a continuous predictor. Traditionally, for such predictors either a linear functional relationship or a step function after grouping is assumed. However, the assumption of linearity may be incorrect, leading to a misspecified final model. For multivariable model building a systematic approach to investigate possible non-linear functional relationships based on fractional polynomials and the combination with backward elimination was proposed recently. So far a program was only available in Stata, certainly preventing a more general application of this useful procedure. The approach will be introduced, advantages will be shown in two examples, a new approach to present FP functions will be illustrated and a macro in SAS will be shortly introduced. Differences to Stata and R programs are noted. (C) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:3464 / 3485
页数:22
相关论文
共 30 条
[1]   DANGERS OF USING OPTIMAL CUTPOINTS IN THE EVALUATION OF PROGNOSTIC FACTORS [J].
ALTMAN, DG ;
LAUSEN, B ;
SAUERBREI, W ;
SCHUMACHER, M .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1994, 86 (11) :829-835
[2]   Fractional polynomial model selection procedures: Investigation of type I error rate [J].
Ambler, G ;
Royston, P .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2001, 69 (01) :89-108
[3]  
[Anonymous], 1990, SUBSET SELECTION REG, DOI DOI 10.1007/978-1-4899-2939-6
[4]  
[Anonymous], STATA TECHNICAL B
[5]   Inflation of the type I error rate when a continuous confounding variable is categorized in logistic regression analyses [J].
Austin, PC ;
Brunner, LJ .
STATISTICS IN MEDICINE, 2004, 23 (07) :1159-1178
[6]   THE CONCEPT OF RESIDUAL CONFOUNDING IN REGRESSION-MODELS AND SOME APPLICATIONS [J].
BECHER, H .
STATISTICS IN MEDICINE, 1992, 11 (13) :1747-1758
[7]   IMPROPER USE OF STATISTICAL SIGNIFICANCE TESTING IN STUDYING COVARIABLES [J].
DALES, LG ;
URY, HK .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 1978, 7 (04) :373-375
[8]   Estimating the functional form of a continuous covariate's effect on survival time [J].
Holländer, N ;
Schumacher, M .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (04) :1131-1151
[9]  
JOHNSON KR, 1996, SOC ANIM, V4, P1
[10]   Evaluating the effect of optimized cutoff values in the assessment of prognostic factors [J].
Lausen, B ;
Schumacher, M .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1996, 21 (03) :307-326