Using published data in Mendelian randomization: a blueprint for efficient identification of causal risk factors

被引:1225
作者
Burgess, Stephen [1 ]
Scott, Robert A. [2 ]
Timpson, Nicholas J. [3 ]
Smith, George Davey [3 ]
Thompson, Simon G. [1 ]
机构
[1] Univ Cambridge, Dept Publ Hlth & Primary Care, Cambridge, England
[2] Univ Cambridge, MRC Epidemiol Unit, Cambridge, England
[3] Univ Bristol, MRC Integrat Epidemiol Unit, Bristol, Avon, England
基金
英国医学研究理事会; 英国惠康基金;
关键词
Mendelian randomization; Instrumental variable; Causal inference; Published data; Two-sample Mendelian randomization; Summarized data; CORONARY-HEART-DISEASE; INSTRUMENTAL VARIABLES; GENETIC-VARIANTS; ASSOCIATION; METAANALYSIS; DESIGN; COHORT; LOCI;
D O I
10.1007/s10654-015-0011-z
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Finding individual-level data for adequately-powered Mendelian randomization analyses may be problematic. As publicly-available summarized data on genetic associations with disease outcomes from large consortia are becoming more abundant, use of published data is an attractive analysis strategy for obtaining precise estimates of the causal effects of risk factors on outcomes. We detail the necessary steps for conducting Mendelian randomization investigations using published data, and present novel statistical methods for combining data on the associations of multiple (correlated or uncorrelated) genetic variants with the risk factor and outcome into a single causal effect estimate. A two-sample analysis strategy may be employed, in which evidence on the gene-risk factor and gene-outcome associations are taken from different data sources. These approaches allow the efficient identification of risk factors that are suitable targets for clinical intervention from published data, although the ability to assess the assumptions necessary for causal inference is diminished. Methods and guidance are illustrated using the example of the causal effect of serum calcium levels on fasting glucose concentrations. The estimated causal effect of a 1 standard deviation (0.13 mmol/L) increase in calcium levels on fasting glucose (mM) using a single lead variant from the CASR gene region is 0.044 (95 % credible interval -0.002, 0.100). In contrast, using our method to account for the correlation between variants, the corresponding estimate using 17 genetic variants is 0.022 (95 % credible interval 0.009, 0.035), a more clearly positive causal effect.
引用
收藏
页码:543 / 552
页数:10
相关论文
共 38 条
[1]  
[Anonymous], TECHNICAL REPORT
[2]  
BASMANN RL, 1960, J AM STAT ASSOC, V55, P650
[3]   Instrumental variables and GMM: Estimation and testing [J].
Baum, Christopher F. ;
Schaffer, Mark E. ;
Stillman, Steven .
STATA JOURNAL, 2003, 3 (01) :1-31
[4]  
Burgess S, 2015, AM J EPIDEMIOL, V181, P251, DOI 10.1093/aje/kwu283
[5]   Methods for meta-analysis of individual participant data from Mendelian randomisation studies with binary outcomes [J].
Burgess, Stephen ;
Thompson, Simon G. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (01) :272-293
[6]   Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data [J].
Burgess, Stephen ;
Butterworth, Adam ;
Thompson, Simon G. .
GENETIC EPIDEMIOLOGY, 2013, 37 (07) :658-665
[7]   Use of allele scores as instrumental variables for Mendelian randomization [J].
Burgess, Stephen ;
Thompson, Simon G. .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2013, 42 (04) :1134-1144
[8]   Use of Mendelian randomisation to assess potential benefit of clinical intervention [J].
Burgess, Stephen ;
Butterworth, Adam ;
Malarstig, Anders ;
Thompson, Simon G. .
BMJ-BRITISH MEDICAL JOURNAL, 2012, 345
[9]   Avoiding bias from weak instruments in Mendelian randomization studies [J].
Burgess, Stephen ;
Thompson, Simon G. .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2011, 40 (03) :755-764
[10]   Mendelian randomization as an instrumental variable approach to causal inference [J].
Didelez, Vanessa ;
Sheehan, Nuala .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (04) :309-330