Average Causal Effects From Nonrandomized Studies: A Practical Guide and Simulated Example

被引:362
作者
Schafer, Joseph L. [1 ,2 ]
Kang, Joseph [3 ]
机构
[1] Penn State Univ, Methodol Ctr, State Coll, PA 16801 USA
[2] Penn State Univ, Dept Stat, State Coll, PA 16801 USA
[3] Northwestern Univ, Feinberg Sch Med, Dept Prevent Med, Evanston, IL 60208 USA
关键词
nonequivalent control group design; propensity scores; Rubin's causal model;
D O I
10.1037/a0014268
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
In a well-designed experiment, random assignment of participants to treatments makes causal inference straightforward. However, if participants are not randomized (as in observational study, quasi-experiment, or nonequivalent control-group designs), group comparisons may be biased by confounders that influence both the outcome and the alleged cause. Traditional analysis of covariance, which includes confounders as predictors in a regression model, often fails to eliminate this bias. In this article, the authors review Rubin's definition of an average causal effect (ACE) as the average difference between potential outcomes under different treatments. The authors distinguish an ACE and a regression coefficient. The authors review 9 strategies for estimating ACEs on the basis of regression, propensity scores, and doubly robust methods, providing formulas for standard errors not given elsewhere. To illustrate the methods, the authors simulate an observational study to assess the effects of dieting on emotional distress. Drawing repeated samples from a simulated population of adolescent girls, the authors assess each method in terms of bias, efficiency, and interval coverage. Throughout the article, the authors offer insights and practical guidance for researchers who attempt causal inference with observational data.
引用
收藏
页码:279 / 313
页数:35
相关论文
共 141 条
[1]  
Agresti A., 2002, Categorical data analysis
[2]  
Aiken L. S., 1991, Multiple Regression: Testing and Interpreting Interactions
[3]   BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA [J].
ALBERT, JH ;
CHIB, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) :669-679
[4]  
Allison PD, 2010, HANDBOOK OF SURVEY RESEARCH, 2ND EDITION, P631
[5]  
Angrist JD, 1996, J AM STAT ASSOC, V91, P444, DOI 10.2307/2291629
[6]  
[Anonymous], 2004, Handbook of data Analysis, DOI DOI 10.4135/9781848608184
[7]  
[Anonymous], HDB STAT MODELING SO, DOI [10.1007/978-1-4899-1292-3_1, DOI 10.1007/978-1-4899-1292-3_1]
[8]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[9]   ADJUSTING FOR DEMOGRAPHIC COVARIATES BY THE ANALYSIS OF COVARIANCE [J].
BERMAN, NC ;
GREENHOUSE, SW .
JOURNAL OF CLINICAL AND EXPERIMENTAL NEUROPSYCHOLOGY, 1992, 14 (06) :981-982
[10]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669