Effect of omitted confounders on the analysis of correlated binary data

被引:20
作者
Chao, WH [1 ]
Palta, M [1 ]
Young, T [1 ]
机构
[1] ACAD SINICA,INST STAT SCI,TAIPEI 11529,TAIWAN
关键词
confounding; generalized estimating equations; generalized linear models; longitudinal data; model misspecification; population averaged models;
D O I
10.2307/2533967
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Marginal analysis using the generalized estimating equation approach is widely applied to correlated observations, as occur in studies with clusters and in longitudinal follow-up of individuals. In this article, we investigate the effect of confounding in such models. We assume that a risk factor x and a confounder z are related by a generalized linear model to the outcome y, which can be binary or ordinal. In order to investigate confounding arising from the omission of z, a joint structure for x and z must be specified. Modeling normally distributed (x,z) as sums of between- and within-individual (or cluster) components allows us to incorporate different degrees of between and within-individual correlation. Such a structure includes, as special cases, cohort and period effects in longitudinal settings and random intercept models. The latter situation corresponds to allowing z to vary only on the between-individual (or cluster) level and to be uncorrelated with x, and leads to attenuation of the coefficient of x in marginal models with the legit and probit links. More complex situations occur when z is allowed to also vary on the within-individual (or cluster) level and when z is correlated with x. We examine the model specification and the expected bias when fitting a marginal model in the presence of the omitted confounder z. We derive general formulas and interpret the parameters and results in an ongoing cohort study. Testing for omitted I covariates is also discussed.
引用
收藏
页码:678 / 689
页数:12
相关论文
共 22 条
[1]   Historical change in the report of daytime fatigue [J].
Bliwise, DL .
SLEEP, 1996, 19 (06) :462-464
[2]   A METHOD FOR GENERATING HIGH-DIMENSIONAL MULTIVARIATE BINARY VARIATES [J].
EMRICH, LJ ;
PIEDMONTE, MR .
AMERICAN STATISTICIAN, 1991, 45 (04) :302-304
[3]   TESTS FOR NO TREATMENT EFFECT IN RANDOMIZED CLINICAL-TRIALS [J].
GAIL, MH ;
TAN, WY ;
PIANTADOSI, S .
BIOMETRIKA, 1988, 75 (01) :57-64
[4]  
GAIL MH, 1984, BIOMETRIKA, V71, P131
[6]  
JOHNSON NL, 1970, DISTRIBUTIONS STATIS, V2
[8]   LONGITUDINAL DATA-ANALYSIS USING GENERALIZED LINEAR-MODELS [J].
LIANG, KY ;
ZEGER, SL .
BIOMETRIKA, 1986, 73 (01) :13-22
[9]   ANALYSIS OF REPEATED CATEGORICAL-DATA USING GENERALIZED ESTIMATING EQUATIONS [J].
LIPSITZ, SR ;
KIM, K ;
ZHAO, LP .
STATISTICS IN MEDICINE, 1994, 13 (11) :1149-1163
[10]  
McCullagh P., 1989, GEN LINEAR MODELS, DOI [DOI 10.1007/978-1-4899-3242-6, 10.1201/9780203753736, DOI 10.2307/2347392]