Practical and statistical issues in missing data for longitudinal patient-reported outcomes

被引:210
作者
Bell, Melanie L. [1 ]
Fairclough, Diane L. [2 ]
机构
[1] Univ Sydney, Psychooncol Cooperat Res Grp PoCoG, Sydney, NSW 2006, Australia
[2] Colorado Sch Publ Hlth, Dept Biostat & Informat, Aurora, CO USA
关键词
Missing data; maximum likelihood estimation; generalized estimating equations; multiple imputation; quality of life; patient reported outcomes; cancer; QUALITY-OF-LIFE; GENERALIZED ESTIMATING EQUATIONS; DOUBLY ROBUST ESTIMATION; RESEARCH DESIGN-PROBLEMS; CANCER CLINICAL-TRIALS; MULTIPLE-IMPUTATION; RANDOMIZED-TRIALS; FUNCTIONAL ASSESSMENT; JOINT ANALYSIS; DROP-OUT;
D O I
10.1177/0962280213476378
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Patient-reported outcomes are increasingly used in health research, including randomized controlled trials and observational studies. However, the validity of results in longitudinal studies can crucially hinge on the handling of missing data. This paper considers the issues of missing data at each stage of research. Practical strategies for minimizing missingness through careful study design and conduct are given. Statistical approaches that are commonly used, but should be avoided, are discussed, including how these methods can yield biased and misleading results. Methods that are valid for data which are missing at random are outlined, including maximum likelihood methods, multiple imputation and extensions to generalized estimating equations: weighted generalized estimating equations, generalized estimating equations with multiple imputation, and doubly robust generalized estimating equations. Finally, we discuss the importance of sensitivity analyses, including the role of missing not at random models, such as pattern mixture, selection, and shared parameter models. We demonstrate many of these concepts with data from a randomized controlled clinical trial on renal cancer patients, and show that the results are dependent on missingness assumptions and the statistical approach.
引用
收藏
页码:440 / 459
页数:20
相关论文
共 84 条
[71]   Doubly robust generalized estimating equations for longitudinal data [J].
Seaman, Shaun ;
Copas, Andrew .
STATISTICS IN MEDICINE, 2009, 28 (06) :937-955
[72]   Analysis and interpretation of results based on patient-reported outcomes [J].
Sloan, Jeff A. ;
Dueck, Amylou C. ;
Erickson, Pennifer A. ;
Guess, Harry ;
Revicki, Dennis A. ;
Santanello, Nancy C. ;
Sloan, Jeff A. .
VALUE IN HEALTH, 2007, 10 :S106-S115
[73]   Joint modelling of bivariate longitudinal data with informative dropout and left-censoring, with application to the evolution of CD4+cell count and HIV RNA viral load in response to treatment of HIV infection [J].
Thiébaut, R ;
Jacqmin-Gadda, H ;
Babiker, A ;
Commenges, D .
STATISTICS IN MEDICINE, 2005, 24 (01) :65-82
[74]   Quality of reporting of observational longitudinal research [J].
Tooth, L ;
Ware, R ;
Bain, C ;
Purdie, DM ;
Dobson, A .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2005, 161 (03) :280-288
[75]   BIASED-ESTIMATION OF THE ODDS RATIO IN CASE-CONTROL STUDIES DUE TO THE USE OF AD HOC METHODS OF CORRECTING FOR MISSING VALUES FOR CONFOUNDING VARIABLES [J].
VACH, W ;
BLETTNER, M .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1991, 134 (08) :895-907
[76]  
Van Buuren S, 1999, STAT MED, V18, P681, DOI 10.1002/(SICI)1097-0258(19990330)18:6<681::AID-SIM71>3.0.CO
[77]  
2-R
[78]  
Verbeke G, 2009, SPRINGER SER STAT, P1
[79]   Shared parameter models for the joint analysis of longitudinal data and event times [J].
Vonesh, EF ;
Greene, T ;
Schluchter, MD .
STATISTICS IN MEDICINE, 2006, 25 (01) :143-163
[80]   Correction of bias from non-random missing longitudinal data using auxiliary information [J].
Wang, Cuiling ;
Hall, Charles B. .
STATISTICS IN MEDICINE, 2010, 29 (06) :671-679