Regression models for twin studies: a critical review

被引:385
作者
Carlin, JB
Gurrin, LC
Sterne, JAC
Morley, R
Dwyer, T
机构
[1] Royal Childrens Hosp, Murdoch Childrens Res Inst, Clin Epidemiol & Biostat Unit, Melbourne, Vic, Australia
[2] Univ Melbourne, Sch Populat Hlth, Epiemiol & Biostat Unit, Melbourne, Vic, Australia
[3] Univ Bristol, Dept Social Med, Bristol, Avon, England
[4] Univ Melbourne, Dept Paediat, Melbourne, Vic, Australia
关键词
statistics; twin studies; regression models; data correlation;
D O I
10.1093/ije/dyi153
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Twin studies have long been recognized for their value in learning about the aetiology of disease and specifically for their potential for separating genetic effects from environmental effects. The recent upsurge of interest in life-course epidemiology and the study of developmental influences on later health has provided a new impetus to study twins as a source of unique insights. Twins are of special interest because they provide naturally matched pairs where the confounding effects of a large number of potentially causal factors (such as maternal nutrition or gestation length) may be removed by comparisons between twins who share them. The traditional tool of epidemiological 'risk factor analysis' is the regression model, but it is not straightforward to transfer standard regression methods to twin data, because the analysis needs to reflect the paired structure of the data, which induces correlation between twins. This paper reviews the use of more specialized regression methods for twin data, based on generalized least squares or linear mixed models, and explains the relationship between these methods and the commonly used approach of analysing within-twin-pair difference values. Methods and issues of interpretation are illustrated using an example from a recent study of the association between birth weight and cord blood erythropoietin. We focus on the analysis of continuous outcome measures but review additional complexities that arise with binary outcomes. We recommend the use of a general model that includes separate regression coefficients for within-twin-pair and between-pair effects, and provide guidelines for the interpretation of estimates obtained under this model.
引用
收藏
页码:1089 / 1099
页数:11
相关论文
共 32 条
[1]   Separation of individual-level and cluster-level covariate effects in regression analysis of correlated data [J].
Begg, MD ;
Parides, MK .
STATISTICS IN MEDICINE, 2003, 22 (16) :2591-2602
[2]  
Berk RA., 2004, Regression analysis: a constructive critique
[3]  
Carlin J B, 2001, Biostatistics, V2, P397, DOI 10.1093/biostatistics/2.4.397
[4]  
Carlin JB, 1999, STAT MED, V18, P2655, DOI 10.1002/(SICI)1097-0258(19991015)18:19<2655::AID-SIM202>3.3.CO
[5]  
2-R
[6]   Do genetic factors contribute to the association between birth weight and blood pressure? [J].
Christensen, K ;
Stovring, H ;
McGue, M .
JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2001, 55 (08) :583-587
[7]   Modeling postnatal exposures and their interactions with birth size [J].
Cole, TJ .
JOURNAL OF NUTRITION, 2004, 134 (01) :201-204
[8]   Within pair association between birth weight and blood pressure at age 8 in twins for a cohort study [J].
Dwyer, T ;
Blizzard, L ;
Morley, R ;
Ponsonby, AL .
BMJ-BRITISH MEDICAL JOURNAL, 1999, 319 (7221) :1325-1329
[9]   Twins and fetal origins hypothesis: within-pair analyses [J].
Dwyer, T ;
Morley, R ;
Blizzard, L .
LANCET, 2002, 359 (9324) :2205-2206
[10]   An overview of relations among causal modelling methods [J].
Greenland, S ;
Brumback, B .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2002, 31 (05) :1030-1037