Weaknesses of goodness-of-fit tests for evaluating propensity score models: the case of the omitted confounder

被引:69
作者
Weitzen, S
Lapane, KL
Toledano, AY
Hume, AL
Mor, V
机构
[1] Brown Univ, Dept Obstet & Gynecol, Providence, RI 02912 USA
[2] Brown Med Sch, Dept Community Hlth, Providence, RI USA
[3] Women & Infants Hosp Rhode Isl, Div Res, Dept Obstet & Gynecol, Providence, RI USA
[4] Brown Med Sch, Ctr Stat Sci, Providence, RI USA
[5] Univ Rhode Isl, Dept Pharm Practice, Kingston, RI USA
[6] Brown Med Sch, Dept Family Med, Providence, RI USA
关键词
propensity score; logistic regression; goodness-of-fit; c-statistic; residual confounding;
D O I
10.1002/pds.986
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose Propensity scores are used in observational studies to adjust for confounding, although they do not provide control for confounders omitted from the propensity score model. We sought to determine if tests used to evaluate logistic model fit and discrimination would be helpful in detecting the omission of an important confounder in the propensity score. Methods Using simulated data, we estimated propensity scores under two scenarios: (1) including all confounders and (2) omitting the binary confounder. We compared the propensity score model fit and discrimination under each scenario, using the Hosmer-Lemeshow goodness-of-fit (GOF) test and the c-statistic. We measured residual confounding in treatment effect estimates adjusted by the propensity score omitting the confounder. Results The GOF statistic and discrimination of propensity score models were the same for models excluding an important predictor of treatment compared to the full propensity score model. The GOF test failed to detect poor model fit for the propensity score model omitting the confounder. C-statistics under both scenarios were similar. Residual confounding was observed from using the propensity score excluding the confounder (range: 1-30%). Conclusions Omission of important confounders from the propensity score leads to residual confounding in estimates of treatment effect. However, tests of GOF and discrimination do not provide information to detect missing confounders in propensity score models. Our findings suggest that it may not be necessary to compute GOF statistics or model discrimination when developing propensity score models. Copyright (c) 2004 John Wiley & Sons, Ltd.
引用
收藏
页码:227 / 238
页数:12
相关论文
共 33 条
[1]  
[Anonymous], 1996, Multivariable Analysis: An Introduction
[2]   Logistic regression in the medical literature: Standards for use and reporting, with particular attention to one medical domain [J].
Bagley, SC ;
White, H ;
Golomb, BA .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2001, 54 (10) :979-985
[3]  
Bertolini G, 2000, J Epidemiol Biostat, V5, P251
[4]   Rare outcomes, common treatments: Analytic strategies using propensity scores [J].
Braitman, LE ;
Rosenbaum, PR .
ANNALS OF INTERNAL MEDICINE, 2002, 137 (08) :693-695
[5]   Comparison of logistic regression versus propensity score when the number of events is low and there are multiple confounders [J].
Cepeda, MS ;
Boston, R ;
Farrar, JT ;
Strom, BL .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2003, 158 (03) :280-287
[6]   The effectiveness of right heart catheterization in the initial care of critically ill patients [J].
Connors, AF ;
Speroff, T ;
Dawson, NV ;
Thomas, C ;
Harrell, FE ;
Wagner, D ;
Desbiens, N ;
Goldman, L ;
Wu, AW ;
Califf, RM ;
Fulkerson, WJ ;
Vidaillet, H ;
Broste, S ;
Bellamy, P ;
Lynn, J ;
Knaus, WA .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1996, 276 (11) :889-897
[7]  
D'Agostino RB, 1998, STAT MED, V17, P2265, DOI 10.1002/(SICI)1097-0258(19981015)17:19<2265::AID-SIM918>3.0.CO
[8]  
2-B
[9]   EFFECTS OF MISSPECIFICATION OF THE PROPENSITY SCORE ON ESTIMATORS OF TREATMENT EFFECT [J].
DRAKE, C .
BIOMETRICS, 1993, 49 (04) :1231-1236
[10]   MODELING AND VARIABLE SELECTION IN EPIDEMIOLOGIC ANALYSIS [J].
GREENLAND, S .
AMERICAN JOURNAL OF PUBLIC HEALTH, 1989, 79 (03) :340-349