The impact of a misspecified random-effects distribution on the estimation and the performance of inferential procedures in generalized linear mixed models

被引:85
作者
Litiere, S. [1 ]
Alonso, A. [1 ]
Molenberghs, G. [1 ]
机构
[1] Hasselt Univ, Ctr Stat, BE-3590 Diepenbeek, Belgium
关键词
consistency; heterogeneity model; Kullback-Leibler information criterion; non-normal random effects; power; type I error;
D O I
10.1002/sim.3157
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Estimation in generalized linear mixed models (GLMMs) is often based on maximum likelihood theory, assuming that the underlying probability model is correctly specified. However, the validity of this assumption is sometimes difficult to verify. In this paper we study, through simulations, the impact of misspecifying the random-effects distribution on the estimation and hypothesis testing in GLMMs. It is shown that the maximum likelihood estimators are inconsistent in the presence of misspecification. The bias induced in the mean-structure parameters is generally small, as far as the variability of the underlying random-effects distribution is small as well. However, the estimates of this variability are always severely biased. Given that the variance components are the only tool to study the variability of the true distribution, it is difficult to assess whether problems in the estimation of the mean structure occur. The type I error rate and the power of the commonly used inferential procedures are also severely affected. The situation is aggravated if more than one random effect is included in the model. Further, we propose to deal with possible misspecification by way of sensitivity analysis, considering several random-effects distributions. All the results are illustrated using data from a clinical trial in schizophrenia. Copyright (C) 2007 John Wiley & Sons, Ltd.
引用
收藏
页码:3125 / 3144
页数:20
相关论文
共 27 条
[1]   Examples in which misspecification of a random effects distribution reduces efficiency, and possible remedies [J].
Agresti, A ;
Caffo, B ;
Ohman-Strickland, P .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 47 (03) :639-653
[2]   A general maximum likelihood analysis of variance components in generalized linear models [J].
Aitkin, M .
BIOMETRICS, 1999, 55 (01) :117-128
[3]  
Aitkin M, 1999, STAT MED, V18, P2343, DOI 10.1002/(SICI)1097-0258(19990915/30)18:17/18<2343::AID-SIM260>3.0.CO
[4]  
2-3
[5]   Validation of surrogate markers in multiple randomized clinical trials with repeated measurements: Canonical correlation approach [J].
Alonso, A ;
Geys, H ;
Molenberghs, G ;
Kenward, MG ;
Vangeneugden, T .
BIOMETRICS, 2004, 60 (04) :845-853
[6]  
[Anonymous], 2002, ANAL LONGITUDINAL DA
[7]  
Arends LR, 2000, STAT MED, V19, P3497, DOI 10.1002/1097-0258(20001230)19:24<3497::AID-SIM830>3.0.CO
[8]  
2-H
[9]  
Bohning D., 1999, MG STAT PRO, V81
[10]   A Bayesian semiparametric model for random-effects meta-analysis [J].
Burr, D ;
Doss, H .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) :242-251