Marginal analysis of incomplete longitudinal binary data: A cautionary note on LOCF imputation

被引:58
作者
Cook, RJ [1 ]
Zeng, LL [1 ]
Yi, GY [1 ]
机构
[1] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
关键词
drop-outs; generalized estimating equations; imputation; longitudinal studies; missing data; misspecified models;
D O I
10.1111/j.0006-341X.2004.00234.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years there has been considerable research devoted to the development of methods for the analysis of incomplete data in longitudinal studies. Despite these advances, the methods used in practice have changed relatively little, particularly in the reporting of pharmaceutical trials. In this setting, perhaps the most widely adopted strategy for dealing with incomplete longitudinal data is imputation by the "last observation carried forward" (LOCF) approach, in which values for missing responses are imputed using observations from the most recently completed assessment. We examine the asymptotic and empirical bias, the empirical type I error rate, and the empirical coverage probability associated with estimators and tests of treatment effect based on the LOCF imputation strategy. We consider a setting involving longitudinal binary data with longitudinal analyses based on generalized estimating equations, and an analysis based simply on the response at the end of the scheduled follow-up. We find that for both of these approaches, imputation by LOCF can lead to substantial biases in estimators of treatment effects, the type I error rates of associated tests can be greatly inflated, and the coverage probability can be far from the nominal level. Alternative analyses based on all available data lead to estimators with comparatively small bias, and inverse probability weighted analyses yield consistent estimators subject to correct specification of the missing data process. We illustrate the differences between various methods of dealing with drop-outs using data from a study of smoking behavior.
引用
收藏
页码:820 / 828
页数:9
相关论文
共 21 条
[1]   Effectiveness of a social influences smoking prevention program as a function of provider type, training method, and school risk [J].
Cameron, R ;
Brown, KS ;
Best, JA ;
Pelkman, CL ;
Madill, CL ;
Manske, SR ;
Payne, ME .
AMERICAN JOURNAL OF PUBLIC HEALTH, 1999, 89 (12) :1827-1831
[2]  
CROWDER M, 1995, BIOMETRIKA, V82, P407
[3]   MISSING DATA, IMPUTATION, AND THE BOOTSTRAP [J].
EFRON, B .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (426) :463-475
[4]  
FITZMAURICE GM, 1995, J ROY STAT SOC B MET, V57, P691
[5]   ANALYZING INCOMPLETE LONGITUDINAL BINARY RESPONSES - A LIKELIHOOD-BASED APPROACH [J].
FITZMAURICE, GM ;
LAIRD, NM ;
LIPSITZ, SR .
BIOMETRICS, 1994, 50 (03) :601-612
[6]   MISSING DATA IN LONGITUDINAL-STUDIES [J].
LAIRD, NM .
STATISTICS IN MEDICINE, 1988, 7 (1-2) :305-315
[7]  
LIANG KY, 1992, J R STAT SOC B, V54, P3
[8]  
LIANG KY, 1986, BIOMETRIKA, V73, P13, DOI 10.1093/biomet/73.1.13
[9]  
Lindsey JK, 1998, STAT MED, V17, P447, DOI 10.1002/(SICI)1097-0258(19980228)17:4<447::AID-SIM752>3.0.CO
[10]  
2-G