Missing responses in generalised linear mixed models when the missing data mechanism is nonignorable

被引:169
作者
Ibrahim, JG
Chen, MH
Lipsitz, SR
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Dana Farber Canc Inst, Boston, MA 02115 USA
[3] Worcester Polytech Inst, Dept Math Sci, Worcester, MA 01609 USA
[4] Med Univ S Carolina, Dept Biostat & Epidemiol, Charleston, SC 29425 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
EM algorithm; Gibbs sampling; maximum likelihood estimation; missing data mechanism; Monte Carlo EM; random effects model;
D O I
10.1093/biomet/88.2.551
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose a method for estimating parameters in the generalised linear mixed model with nonignorable missing response data and with nonmonotone patterns of missing data in the response variable. We develop a Monte Carlo EM algorithm for estimating the parameters in the model via the Gibbs sampler. For the normal random effects model, we derive a novel analytical form for the E- and M-steps, which is facilitated by integrating out the random effects. This form leads to a computationally feasible and extremely efficient Monte Carlo EM algorithm for computing maximum likelihood estimates and standard errors. In addition, we propose a very general joint multinomial model for the missing data indicators, which can be specified via a sequence of one-dimensional conditional distributions. This multinomial model allows for an arbitrary correlation structure between the missing data indicators, and has the potential of reducing the number of nuisance parameters. Real datasets from the International Breast Cancer Study Group and an environmental study involving dyspnoea in cotton workers are presented to illustrate the proposed methods.
引用
收藏
页码:551 / 564
页数:14
相关论文
共 31 条
[1]  
Albert J, 1995, BIOMETRIKA, V82, P747, DOI 10.1093/biomet/82.4.747
[2]  
[Anonymous], APPL STAT, DOI DOI 10.2307/2347565
[3]   MARGINAL REGRESSION FOR REPEATED BINARY DATA WITH OUTCOME SUBJECT TO NON-IGNORABLE NONRESPONSE [J].
BAKER, SG .
BIOMETRICS, 1995, 51 (03) :1042-1052
[4]  
Bernardo, 1996, BAYESIAN STAT, P165
[5]   Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm [J].
Booth, JG ;
Hobert, JP .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 :265-285
[6]   APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS [J].
BRESLOW, NE ;
CLAYTON, DG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :9-25
[7]  
Diggle P. G., 1994, J ROY STAT SOC C, V43, P49
[8]  
Diggle P. J., 2002, ANAL LONGITUDINAL DA
[9]   Multivariate logistic models for incomplete binary responses [J].
Fitzmaurice, GM ;
Laird, NM ;
Zahner, GEP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (433) :99-108
[10]   AN APPROXIMATE GENERALIZED LINEAR-MODEL WITH RANDOM EFFECTS FOR INFORMATIVE MISSING DATA [J].
FOLLMANN, D ;
WU, M .
BIOMETRICS, 1995, 51 (01) :151-168