The performance of different propensity score methods for estimating marginal hazard ratios

被引:689
作者
Austin, Peter C. [1 ,2 ,3 ]
机构
[1] Inst Clin Evaluat Sci, Toronto, ON M4N 3M5, Canada
[2] Univ Toronto, Inst Hlth Management Policy & Evaluat, Toronto, ON, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
基金
加拿大健康研究院;
关键词
propensity score; survival analysis; inverse probability of treatment weighting (IPTW); Monte Carlo simulations; observational study; time-to-event outcomes; INVERSE PROBABILITY; UNTREATED SUBJECTS; RISK DIFFERENCES; MODELS; SUBCLASSIFICATION; ADJUSTMENT; BIAS;
D O I
10.1002/sim.5705
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Propensity score methods are increasingly being used to reduce or minimize the effects of confounding when estimating the effects of treatments, exposures, or interventions when using observational or non-randomized data. Under the assumption of no unmeasured confounders, previous research has shown that propensity score methods allow for unbiased estimation of linear treatment effects (e.g., differences in means or proportions). However, in biomedical research, time-to-event outcomes occur frequently. There is a paucity of research into the performance of different propensity score methods for estimating the effect of treatment on time-to-event outcomes. Furthermore, propensity score methods allow for the estimation of marginal or population-average treatment effects. We conducted an extensive series of Monte Carlo simulations to examine the performance of propensity score matching (1:1 greedy nearest-neighbor matching within propensity score calipers), stratification on the propensity score, inverse probability of treatment weighting (IPTW) using the propensity score, and covariate adjustment using the propensity score to estimate marginal hazard ratios. We found that both propensity score matching and IPTW using the propensity score allow for the estimation of marginal hazard ratios with minimal bias. Of these two approaches, IPTW using the propensity score resulted in estimates with lower mean squared error when estimating the effect of treatment in the treated. Stratification on the propensity score and covariate adjustment using the propensity score result in biased estimation of both marginal and conditional hazard ratios. Applied researchers are encouraged to use propensity score matching and IPTW using the propensity score when estimating the relative effect of treatment on time-to-event outcomes. Copyright (c) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:2837 / 2849
页数:13
相关论文
共 37 条
[1]  
[Anonymous], 2002, Springer Series in Statistics
[2]   The performance of two data-generation processes for data with specified marginal treatment odds ratios [J].
Austin, Peter C. ;
Stafford, James .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2008, 37 (06) :1039-1051
[3]  
Austin PC, 2008, STAT MED, V27, P2037, DOI 10.1002/sim.3150
[4]   The performance of different propensity-score methods for estimating relative risks [J].
Austin, Peter C. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2008, 61 (06) :537-545
[5]   The performance of different propensity score methods for estimating marginal odds ratios [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2007, 26 (16) :3078-3094
[6]   A comparison of the ability of different propensity score models to balance measured variables between treated and untreated subjects: a Monte Carlo study [J].
Austin, Peter C. ;
Grootendorst, Paul ;
Anderson, Geoffrey M. .
STATISTICS IN MEDICINE, 2007, 26 (04) :734-753
[7]   Conditioning on the propensity score can result in biased estimation of common measures of treatment effect: A Monte Carlo study (p n/a) [J].
Austin, Peter C. ;
Grootendorst, Paul ;
Normand, Sharon-Lise T. ;
Anderson, Geoffrey M. .
STATISTICS IN MEDICINE, 2007, 26 (16) :3208-3210
[8]   Comparing paired vs non-paired statistical methods of analyses when making inferences about absolute risk reductions in propensity-score matched samples [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2011, 30 (11) :1292-1301
[9]   Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies [J].
Austin, Peter C. .
PHARMACEUTICAL STATISTICS, 2011, 10 (02) :150-161
[10]   Statistical Criteria for Selecting the Optimal Number of Untreated Subjects Matched to Each Treated Subject When Using Many-to-One Matching on the Propensity Score [J].
Austin, Peter C. .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2010, 172 (09) :1092-1097