Practical and statistical issues in missing data for longitudinal patient-reported outcomes

被引:210
作者
Bell, Melanie L. [1 ]
Fairclough, Diane L. [2 ]
机构
[1] Univ Sydney, Psychooncol Cooperat Res Grp PoCoG, Sydney, NSW 2006, Australia
[2] Colorado Sch Publ Hlth, Dept Biostat & Informat, Aurora, CO USA
关键词
Missing data; maximum likelihood estimation; generalized estimating equations; multiple imputation; quality of life; patient reported outcomes; cancer; QUALITY-OF-LIFE; GENERALIZED ESTIMATING EQUATIONS; DOUBLY ROBUST ESTIMATION; RESEARCH DESIGN-PROBLEMS; CANCER CLINICAL-TRIALS; MULTIPLE-IMPUTATION; RANDOMIZED-TRIALS; FUNCTIONAL ASSESSMENT; JOINT ANALYSIS; DROP-OUT;
D O I
10.1177/0962280213476378
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Patient-reported outcomes are increasingly used in health research, including randomized controlled trials and observational studies. However, the validity of results in longitudinal studies can crucially hinge on the handling of missing data. This paper considers the issues of missing data at each stage of research. Practical strategies for minimizing missingness through careful study design and conduct are given. Statistical approaches that are commonly used, but should be avoided, are discussed, including how these methods can yield biased and misleading results. Methods that are valid for data which are missing at random are outlined, including maximum likelihood methods, multiple imputation and extensions to generalized estimating equations: weighted generalized estimating equations, generalized estimating equations with multiple imputation, and doubly robust generalized estimating equations. Finally, we discuss the importance of sensitivity analyses, including the role of missing not at random models, such as pattern mixture, selection, and shared parameter models. We demonstrate many of these concepts with data from a randomized controlled clinical trial on renal cancer patients, and show that the results are dependent on missingness assumptions and the statistical approach.
引用
收藏
页码:440 / 459
页数:20
相关论文
共 84 条
[41]   Tutorial in biostatistics - Handling drop-out in longitudinal studies [J].
Hogan, JW ;
Roy, J ;
Korkontzelou, C .
STATISTICS IN MEDICINE, 2004, 23 (09) :1455-1497
[42]   Much ado about nothing: A comparison of missing data methods and software to fit incomplete data regression models [J].
Horton, Nicholas J. ;
Kleinman, Ken P. .
AMERICAN STATISTICIAN, 2007, 61 (01) :79-90
[43]   GOODNESS OF FIT TESTS FOR THE MULTIPLE LOGISTIC REGRESSION-MODEL [J].
HOSMER, DW ;
LEMESHOW, S .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1980, 9 (10) :1043-1069
[44]   Using auxiliary data for parameter estimation with non-ignorably missing outcomes [J].
Ibrahim, JG ;
Lipsitz, SR ;
Horton, N .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2001, 50 :361-373
[45]  
Ibrahim JG, 2009, TEST-SPAIN, V18, P1, DOI 10.1007/s11749-009-0138-x
[46]   Use of Missing Data Methods in Longitudinal Studies: The Persistence of Bad Practices in Developmental Psychology [J].
Jelicic, Helena ;
Phelps, Erin ;
Lerner, Richard A. .
DEVELOPMENTAL PSYCHOLOGY, 2009, 45 (04) :1195-1199
[47]   Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data [J].
Kang, Joseph D. Y. ;
Schafer, Joseph L. .
STATISTICAL SCIENCE, 2007, 22 (04) :523-539
[48]   Small sample inference for fixed effects from restricted maximum likelihood [J].
Kenward, MG ;
Roger, JH .
BIOMETRICS, 1997, 53 (03) :983-997
[49]   Longitudinal Data with Follow-up Truncated by Death: Match the Analysis Method to Research Aims [J].
Kurland, Brenda F. ;
Johnson, Laura L. ;
Egleston, Brian L. ;
Diehr, Paula H. .
STATISTICAL SCIENCE, 2009, 24 (02) :211-222
[50]  
LIANG KY, 1986, BIOMETRIKA, V73, P13, DOI 10.1093/biomet/73.1.13