Review of inverse probability weighting for dealing with missing data

被引:1228
作者
Seaman, Shaun R. [1 ]
White, Ian R. [1 ]
机构
[1] Inst Publ Hlth, MRC Biostat Unit, Cambridge CB2 0SR, England
关键词
Asymptotic efficiency; doubly robust; model misspecification; propensity score; DEMYSTIFYING DOUBLE ROBUSTNESS; MULTIPLE-IMPUTATION; CAUSAL INFERENCE; SEMIPARAMETRIC REGRESSION; ALTERNATIVE STRATEGIES; REPEATED OUTCOMES; PREDICTORS; SELECTION; MIDLIFE; NONRESPONSE;
D O I
10.1177/0962280210395740
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
100404 [儿少卫生与妇幼保健学];
摘要
The simplest approach to dealing with missing data is to restrict the analysis to complete cases, i.e. individuals with no missing values. This can induce bias, however. Inverse probability weighting (IPW) is a commonly used method to correct this bias. It is also used to adjust for unequal sampling fractions in sample surveys. This article is a review of the use of IPW in epidemiological research. We describe how the bias in the complete-case analysis arises and how IPW can remove it. IPW is compared with multiple imputation (MI) and we explain why, despite MI generally being more efficient, IPW may sometimes be preferred. We discuss the choice of missingness model and methods such as weight truncation, weight stabilisation and augmented IPW. The use of IPW is illustrated on data from the 1958 British Birth Cohort.
引用
收藏
页码:278 / 295
页数:18
相关论文
共 63 条
[1]
Early predictors of adult drinking: A birth cohort study [J].
Alati, R ;
Najman, JM ;
Kinner, SA ;
Mamun, AA ;
Williams, GM ;
O'Callaghan, M ;
Bor, W .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2005, 162 (11) :1098-1107
[2]
Predictors of follow-up and assessment of selection bias from dropouts using inverse probability weighting in a cohort of university graduates [J].
Alonso, Alvaro ;
Segui-Gomez, Maria ;
de Irala, Jokin ;
Sanchez-Villegas, Almudena ;
Beunza, Juan Jose ;
Martinez-Gonzalez, Miguel Angel .
EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2006, 21 (05) :351-358
[3]
Loss and representativeness in a biomedical survey at age 45 years: 1958 British birth cohort [J].
Atherton, K. ;
Fuller, E. ;
Shepherd, P. ;
Strachan, D. P. ;
Power, C. .
JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2008, 62 (03) :216-223
[4]
The performance of different propensity-score methods for estimating relative risks [J].
Austin, Peter C. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2008, 61 (06) :537-545
[5]
Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[6]
A simulation study comparing weighted estimating equations with multiple imputation based estimating equations for longitudinal binary data [J].
Beunckens, Caroline ;
Sotto, Cristina ;
Molenberghs, Geert .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (03) :1533-1548
[7]
Variable selection for propensity score models [J].
Brookhart, M. Alan ;
Schneeweiss, Sebastian ;
Rothman, Kenneth J. ;
Glynn, Robert J. ;
Avorn, Jerry ;
Sturmer, Til .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (12) :1149-1156
[8]
A semiparametric model selection criterion with applications to the marginal structural model [J].
Brookhart, MA ;
van der Laan, MJ .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (02) :475-498
[9]
Lifecourse socioeconomic predictors of midlife drinking patterns, problems and abstention: Findings from the 1958 British Birth Cohort Study [J].
Caldwell, T. M. ;
Rodgers, B. ;
Clark, C. ;
Jefferis, B. J. M. H. ;
Stansfeld, S. A. ;
Power, C. .
DRUG AND ALCOHOL DEPENDENCE, 2008, 95 (03) :269-278
[10]
Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data [J].
Cao, Weihua ;
Tsiatis, Anastasios A. ;
Davidian, Marie .
BIOMETRIKA, 2009, 96 (03) :723-734