Regression models for clustered binary responses: Implications of ignoring the intracluster correlation in an analysis of perinatal mortality in twin gestations

被引:63
作者
Ananth, CV
Platt, RW
Savitz, DA
机构
[1] Univ Med & Dent New Jersey, Robert Wood Johnson Med Sch, Dept Obstet Gynecol & Reprod Sci, Epidemiol & Biostat Sect, New Brunswick, NJ 08901 USA
[2] McGill Univ, Dept Pediat, Montreal, PQ H3A 2T5, Canada
[3] McGill Univ, Dept Epidemiol & Biostat, Montreal, PQ H3A 2T5, Canada
[4] Univ N Carolina, Sch Publ Hlth, Dept Epidemiol, Chapel Hill, NC USA
关键词
regression models; dependent responses; clustered data; longitudinal study; perinatal mortality; twin gestations;
D O I
10.1016/j.annepidem.2004.08.007
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
PURPOSE: Dependent binary responses, such as health outcomes in twin pairs or siblings, frequently arise in perinatal epidemiologic research. This gives rise to correlated data, which must be taken into account during analysis to avoid erroneous statistical and biological inferences. METHODS: An analysis of perinatal mortality (fetal deaths plus deaths within the first 28 days) in twins in relation to cluster-varying (those that are unique to each fetus within a twin pregnancy such as birthweight) and cluster-constant (those that are identical for both twins within a sibship such as maternal smoking status) risk factors is presented. Marginal (ordinary logistic regression [OLR] and logistic regression using generalized estimating equations [GEE]) and cluster-specific (conditional and random-intercept logistic regression models) regression models are fit and their results contrasted. The United States "matched multiple data" file of twin births (1995-1997), which includes 285,226 twins from 142,613 pregnancies, was used to examine the implications of ignoring of clustering on regression inferences. RESULTS: The OLR models provide variance estimates for cluster constant covariates that ranged from 7% to 71% smaller than those from GEE-based models. This underestimation is even more pronounced for some cluster-varying covariates, ranging from 21% to 198%. CONCLUSIONS: Ignoring the cluster dependency is likely to affect the precision of covariate effects and consequently interpretation of results. With widespread availability of appropriate software, statistical methods for taking the intracluster dependency into account are easily implemented and necessary. (c) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:293 / 301
页数:9
相关论文
共 49 条
[1]  
[Anonymous], 2002, ANAL LONGITUDINAL DA
[2]   Twin pregnancy outcome and chorionicity [J].
Baghdadi, S ;
Gee, H ;
Whittle, MJ ;
Khan, KS .
ACTA OBSTETRICIA ET GYNECOLOGICA SCANDINAVICA, 2003, 82 (01) :18-21
[3]  
Breslow N., 2003, 192 U WASH
[4]  
Chhabra S, 2002, J Obstet Gynaecol, V22, P39, DOI 10.1080/01443610120101691
[5]  
CROWDER M, 1995, BIOMETRIKA, V82, P407
[6]   Does chorionicity or zygosity predict adverse perinatal outcomes in twins? [J].
Dubé, J ;
Dodds, L ;
Armson, BA .
AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2002, 186 (03) :579-583
[7]  
Gelman A, 2013, BAYESIAN DATA ANAL, DOI DOI 10.1201/9780429258411
[8]   Fetal or infant death in twin pregnancy: neurodevelopmental consequence for the survivor [J].
Glinianaia, SV ;
Pharoah, POD ;
Wright, C ;
Rankin, JM .
ARCHIVES OF DISEASE IN CHILDHOOD-FETAL AND NEONATAL EDITION, 2002, 86 (01) :F9-F15
[9]   Statistical analysis of correlated data using generalized estimating equations: An orientation [J].
Hanley, JA ;
Negassa, A ;
Edwardes, MDD ;
Forrester, JE .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2003, 157 (04) :364-375
[10]  
Hanley JA, 2000, STAT MED, V19, P715, DOI 10.1002/(SICI)1097-0258(20000315)19:5<715::AID-SIM342>3.0.CO