Methods for categorizing a prognostic variable in a multivariable setting

被引:102
作者
Mazumdar, M [1 ]
Smith, A [1 ]
Bacik, J [1 ]
机构
[1] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10021 USA
关键词
split-sample approach; two-fold cross-validation approach; log-likelihood statistic; multivariable setting; categorization;
D O I
10.1002/sim.1333
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The literature is filled with examples of categorization of a continuous prognostic variable in a univariable setting followed by the addition of this categorical variable to an existing multivariable model. Typically, an 'optimal' cutpoint for a new prognostic variable is obtained through a systematic search relating the variable to the outcome in an univariable manner. The corresponding categorical variable is then fitted in a multivariable model along with other already established prognostic covariates to assess the additional value of the new variable. This prompts the question whether the cutpoint search should have been performed in the same multivariable setting where it will ultimately be used. In this paper, we extend the univariable cutpoint search methods (split-sample approach and two-fold cross-validation approach) to the multivariable setting using -2 x log-likelihood statistic as the correlative measure. A Monte Carlo simulation study demonstrates that both methods are more efficient in detecting the true cutpoint and in estimating the effect size under the multivariable setting as opposed to the univariable setting. The cross-validation method performs better than the split-sample method in univariable as well as multivariable scenarios. For the cross-validation method in the multivariable setting, there is still a substantial loss of power when a cutpoint model is used in cases where there is a continuous relationship between the covariate and the outcome. An example is provided to illustrate the value of the multivariable cross-validation approach. Copyright (C) 2003 John Wiley Sons, Ltd.
引用
收藏
页码:559 / 571
页数:13
相关论文
共 22 条
[1]   DANGERS OF USING OPTIMAL CUTPOINTS IN THE EVALUATION OF PROGNOSTIC FACTORS [J].
ALTMAN, DG ;
LAUSEN, B ;
SAUERBREI, W ;
SCHUMACHER, M .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1994, 86 (11) :829-835
[2]  
COX DR, 1990, ANAL SURVIVAL DATA, P91
[3]  
Faraggi D, 1996, STAT MED, V15, P2203, DOI 10.1002/(SICI)1097-0258(19961030)15:20<2203::AID-SIM357>3.3.CO
[4]  
2-7
[5]   WHY DO SO MANY PROGNOSTIC FACTORS FAIL TO PAN OUT [J].
HILSENBECK, SG ;
CLARK, GM ;
MCGUIRE, WL .
BREAST CANCER RESEARCH AND TREATMENT, 1992, 22 (03) :197-206
[6]  
Hilsenbeck SG, 1996, STAT MED, V15, P103, DOI 10.1002/(SICI)1097-0258(19960115)15:1<103::AID-SIM156>3.0.CO
[7]  
2-Y
[8]   Hydrocortisone with or without mitoxantrone in men with hormone-refractory prostate cancer: Results of the Cancer and Leukemia Group B 9182 study [J].
Kantoff, PW ;
Halabi, S ;
Conaway, M ;
Picus, J ;
Kirshner, J ;
Hars, V ;
Trump, D ;
Winer, EP ;
Vogelzang, NJ .
JOURNAL OF CLINICAL ONCOLOGY, 1999, 17 (08) :2506-2513
[9]   Evaluating the effect of optimized cutoff values in the assessment of prognostic factors [J].
Lausen, B ;
Schumacher, M .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1996, 21 (03) :307-326
[10]   MAXIMALLY SELECTED RANK STATISTICS [J].
LAUSEN, B ;
SCHUMACHER, M .
BIOMETRICS, 1992, 48 (01) :73-85