Metrics for covariate balance in cohort studies of causal effects

被引:233
作者
Franklin, Jessica M. [1 ,2 ]
Rassen, Jeremy A. [1 ,2 ]
Ackermann, Diana [3 ]
Bartels, Dorothee B. [3 ,4 ]
Schneeweiss, Sebastian [1 ,2 ]
机构
[1] Brigham & Womens Hosp, Dept Med, Div Pharmacoepidemiol & Pharmacoecon, Boston, MA 02120 USA
[2] Harvard Univ, Sch Med, Boston, MA 02120 USA
[3] Boehringer Ingelheim GmbH & Co KG, Dept Global Epidemiol, Ingelheim, Germany
[4] Hannover Med Sch, Dept Epidemiol, Hannover, Germany
关键词
covariate balance; bias; confounding factors; matching; propensity score; MULTIVARIATE MATCHING METHODS; PROPENSITY-SCORE; CLAIMS DATA; BIAS; ADJUSTMENT; MODEL; SUBCLASSIFICATION; SELECTION; TRIALS;
D O I
10.1002/sim.6058
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Inferring causation from non-randomized studies of exposure requires that exposure groups can be balanced with respect to prognostic factors for the outcome. Although there is broad agreement in the literature that balance should be checked, there is confusion regarding the appropriate metric. We present a simulation study that compares several balance metrics with respect to the strength of their association with bias in estimation of the effect of a binary exposure on a binary, count, or continuous outcome. The simulations utilize matching on the propensity score with successively decreasing calipers to produce datasets with varying covariate balance. We propose the post-matching C-statistic as a balance metric and found that it had consistently strong associations with estimation bias, even when the propensity score model was misspecified, as long as the propensity score was estimated with sufficient study size. This metric, along with the average standardized difference and the general weighted difference, outperformed all other metrics considered in association with bias, including the unstandardized absolute difference, Kolmogorov-Smirnov and Levy distances, overlapping coefficient, Mahalanobis balance, and L-1 metrics. Of the best-performing metrics, the C-statistic and general weighted difference also have the advantage that they automatically evaluate balance on all covariates simultaneously and can easily incorporate balance on interactions among covariates. Therefore, when combined with the usual practice of comparing individual covariate means and standard deviations across exposure groups, these metrics may provide useful summaries of the observed covariate imbalance. Copyright (c) 2013 John Wiley & Sons, Ltd.
引用
收藏
页码:1685 / 1699
页数:15
相关论文
共 46 条
[1]   Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies [J].
Austin, Peter C. .
PHARMACEUTICAL STATISTICS, 2011, 10 (02) :150-161
[2]   Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2009, 28 (25) :3083-3107
[3]   Goodness-of-fit diagnostics for the propensity score model when estimating treatment effects using covariate adjustment with the propensity score [J].
Austin, Peter C. .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2008, 17 (12) :1202-1217
[4]   Assessing balance in measured baseline covariates when using many-to-one matching on the propensity-score [J].
Austin, Peter C. .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2008, 17 (12) :1218-1225
[5]   Measuring balance and model selection in propensity score methods [J].
Belitser, Svetlana V. ;
Martens, Edwin P. ;
Pestman, Wiebe R. ;
Groenwold, Rolf H. H. ;
de Boer, Anthonius ;
Klungel, Olaf H. .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2011, 20 (11) :1115-1129
[6]   EFFICIENCY OF MATCHED SAMPLES - AN EMPIRICAL INVESTIGATION [J].
BILLEWICZ, WZ .
BIOMETRICS, 1965, 21 (03) :623-+
[7]  
Bradley EL., 1985, ENCY STAT SCI, V6, P546
[8]   Variable selection for propensity score models [J].
Brookhart, M. Alan ;
Schneeweiss, Sebastian ;
Rothman, Kenneth J. ;
Glynn, Robert J. ;
Avorn, Jerry ;
Sturmer, Til .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (12) :1149-1156
[9]   Evaluating short-term drug effects using a physician-specific prescribing preference as an instrumental variable [J].
Brookhart, MA ;
Wang, PS ;
Solomon, DH ;
Schneeweiss, S .
EPIDEMIOLOGY, 2006, 17 (03) :268-275
[10]   EFFECTIVENESS OF ADJUSTMENT BY SUBCLASSIFICATION IN REMOVING BIAS IN OBSERVATIONAL STUDIES [J].
COCHRAN, WG .
BIOMETRICS, 1968, 24 (02) :295-&