Power and instrument strength requirements for Mendelian randomization studies using multiple genetic variants

被引:1271
作者
Pierce, Brandon L. [1 ]
Ahsan, Habibul
VanderWeele, Tyler J. [2 ,3 ]
机构
[1] Univ Chicago, Ctr Canc Epidemiol & Prevent, Dept Hlth Studies, Chicago, IL 60637 USA
[2] Harvard Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
[3] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
Mendelian randomization; instrumental variable analysis; power; weak instrument; causal inference; two-stage least squares regression; GENOME-WIDE ASSOCIATION; C-REACTIVE PROTEIN; METABOLIC-SYNDROME; VARIABLE ANALYSIS; CAUSAL INFERENCE; COMMON VARIANTS; SERUM URATE; LOCI; RISK; AGE;
D O I
10.1093/ije/dyq151
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background Mendelian Randomization (MR) studies assess the causality of an exposure-disease association using genetic determinants [i.e. instrumental variables (IVs)] of the exposure. Power and IV strength requirements for MR studies using multiple genetic variants have not been explored. Methods We simulated cohort data sets consisting of a normally distributed disease trait, a normally distributed exposure, which affects this trait and a biallelic genetic variant that affects the exposure. We estimated power to detect an effect of exposure on disease for varying allele frequencies, effect sizes and samples sizes (using two-stage least squares regression on 10 000 data sets-Stage 1 is a regression of exposure on the variant. Stage 2 is a regression of disease on the fitted exposure). Similar analyses were conducted using multiple genetic variants (5, 10, 20) as independent or combined IVs. We assessed IV strength using the first-stage F statistic. Results Simulations of realistic scenarios indicate that MR studies will require large (n > 1000), often very large (n > 10 000), sample sizes. In many cases, so-called 'weak IV' problems arise when using multiple variants as independent IVs (even with as few as five), resulting in biased effect estimates. Combining genetic factors into fewer IVs results in modest power decreases, but alleviates weak IV problems. Ideal methods for combining genetic factors depend upon knowledge of the genetic architecture underlying the exposure. Conclusions The feasibility of well-powered, unbiased MR studies will depend upon the amount of variance in the exposure that can be explained by known genetic factors and the 'strength' of the IV set derived from these genetic factors.
引用
收藏
页码:740 / 752
页数:13
相关论文
共 63 条
[1]   2-STAGE LEAST-SQUARES ESTIMATION OF AVERAGE CAUSAL EFFECTS IN MODELS WITH VARIABLE TREATMENT INTENSITY [J].
ANGRIST, JD ;
IMBENS, GW .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (430) :431-442
[2]  
[Anonymous], 1998, Applied Regression Analysis
[3]  
[Anonymous], 2008, The International Journal of Biostatistics, DOI DOI 10.2202/1557-4679.1114
[4]   Loci influencing lipid levels and coronary heart disease risk in 16 European population cohorts [J].
Aulchenko, Yurii S. ;
Ripatti, Samuli ;
Lindqvist, Ida ;
Boomsma, Dorret ;
Heid, Iris M. ;
Pramstaller, Peter P. ;
Penninx, Brenda W. J. H. ;
Janssens, A. Cecile J. W. ;
Wilson, James F. ;
Spector, Tim ;
Martin, Nicholas G. ;
Pedersen, Nancy L. ;
Kyvik, Kirsten Ohm ;
Kaprio, Jaakko ;
Hofman, Albert ;
Freimer, Nelson B. ;
Jarvelin, Marjo-Riitta ;
Gyllensten, Ulf ;
Campbell, Harry ;
Rudan, Igor ;
Johansson, Asa ;
Marroni, Fabio ;
Hayward, Caroline ;
Vitart, Veronique ;
Jonasson, Inger ;
Pattaro, Cristian ;
Wright, Alan ;
Hastie, Nick ;
Pichler, Irene ;
Hicks, Andrew A. ;
Falchi, Mario ;
Willemsen, Gonneke ;
Hottenga, Jouke-Jan ;
de Geus, Eco J. C. ;
Montgomery, Grant W. ;
Whitfield, John ;
Magnusson, Patrik ;
Saharinen, Juha ;
Perola, Markus ;
Silander, Kaisa ;
Isaacs, Aaron ;
Sijbrands, Eric J. G. ;
Uitterlinden, Andre G. ;
Witteman, Jacqueline C. M. ;
Oostra, Ben A. ;
Elliott, Paul ;
Ruokonen, Aimo ;
Sabatti, Chiara ;
Gieger, Christian ;
Meitinger, Thomas .
NATURE GENETICS, 2009, 41 (01) :47-55
[5]   Instrumental variables and GMM: Estimation and testing [J].
Baum, Christopher F. ;
Schaffer, Mark E. ;
Stillman, Steven .
STATA JOURNAL, 2003, 3 (01) :1-31
[6]   A polymorphism within the G6PC2 gene is associated with fasting plasma glucose levels [J].
Bouatia-Naji, Nabila ;
Rocheleau, Ghislain ;
Van Lommel, Leentje ;
Lemaire, Katleen ;
Schuit, Frans ;
Cavalcanti-Proenca, Christine ;
Marchand, Marion ;
Hartikainen, Anna-Liisa ;
Sovio, Ulla ;
De Graeve, Franck ;
Rung, Johan ;
Vaxillaire, Martine ;
Tichet, Jean ;
Marre, Michel ;
Balkau, Beverley ;
Weill, Jacques ;
Elliott, Paul ;
Jarvelin, Marjo-Riitta ;
Meyre, David ;
Polychronakos, Constantin ;
Dina, Christian ;
Sladek, Robert ;
Froguel, Philippe .
SCIENCE, 2008, 320 (5879) :1085-1088
[7]   A variant near MTNR1B is associated with increased fasting plasma glucose levels and type 2 diabetes risk [J].
Bouatia-Naji, Nabila ;
Bonnefond, Amelie ;
Cavalcanti-Proenca, Christine ;
Sparso, Thomas ;
Holmkvist, Johan ;
Marchand, Marion ;
Delplanque, Jerome ;
Lobbens, Stephane ;
Rocheleau, Ghislain ;
Durand, Emmanuelle ;
De Graeve, Franck ;
Chevre, Jean-Claude ;
Borch-Johnsen, Knut ;
Hartikainen, Anna-Liisa ;
Ruokonen, Aimo ;
Tichet, Jean ;
Marre, Michel ;
Weill, Jacques ;
Heude, Barbara ;
Tauber, Maithe ;
Lemaire, Katleen ;
Schuit, Frans ;
Elliott, Paul ;
Jorgensen, Torben ;
Charpentier, Guillaume ;
Hadjadj, Samy ;
Cauchi, Stephane ;
Vaxillaire, Martine ;
Sladek, Robert ;
Visvikis-Siest, Sophie ;
Balkau, Beverley ;
Levy-Marchal, Claire ;
Pattou, Francois ;
Meyre, David ;
Blakemore, Alexandra I. F. ;
Jarvelin, Marjo-Riita ;
Walley, Andrew J. ;
Hansen, Torben ;
Dina, Christian ;
Pedersen, Oluf ;
Froguel, Philippe .
NATURE GENETICS, 2009, 41 (01) :89-94
[8]   PROBLEMS WITH INSTRUMENTAL VARIABLES ESTIMATION WHEN THE CORRELATION BETWEEN THE INSTRUMENTS AND THE ENDOGENOUS EXPLANATORY VARIABLE IS WEAK [J].
BOUND, J ;
JAEGER, DA ;
BAKER, RM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (430) :443-450
[9]   Inflammation, insulin resistance, and diabetes-mendelian randomization using CRP haplotypes points upstream [J].
Brunner, Eric J. ;
Kivimaeki, Mika ;
Witte, Daniel R. ;
Lawlor, Debbie A. ;
Smith, George Davey ;
Cooper, Jackie A. ;
Miller, Michelle ;
Lowe, Gordon D. O. ;
Rumley, Ann ;
Casas, Juan P. ;
Shah, Tina ;
Humphries, Steve E. ;
Hingorani, Aroon D. ;
Marmot, Michael G. ;
Timpson, Nicholas J. ;
Kumari, Meena .
PLOS MEDICINE, 2008, 5 (08) :1278-1286
[10]   Variations in the G6PC2/ABCB11 genomic region are associated with fasting glucose levels [J].
Chen, Wei-Min ;
Erdos, Michael R. ;
Jackson, Anne U. ;
Saxena, Richa ;
Sanna, Serena ;
Silver, Kristi D. ;
Timpson, Nicholas J. ;
Hansen, Torben ;
Orru, Marco ;
Piras, Maria Grazia ;
Bonnycastle, Lori L. ;
Willer, Cristen J. ;
Lyssenko, Valeriya ;
Shen, Haiqing ;
Kuusisto, Johanna ;
Ebrahim, Shah ;
Sestu, Natascia ;
Duren, William L. ;
Spada, Maria Cristina ;
Stringham, Heather M. ;
Scott, Laura J. ;
Olla, Nazario ;
Swift, Amy J. ;
Najjar, Samer ;
Mitchell, Braxton D. ;
Lawlor, Debbie A. ;
Smith, George Davey ;
Ben-Shlomo, Yoav ;
Andersen, Gitte ;
Borch-Johnsen, Knut ;
Jorgensen, Torben ;
Saramies, Jouko ;
Valle, Timo T. ;
Buchanan, Thomas A. ;
Shuldiner, Alan R. ;
Lakatta, Edward ;
Bergman, Richard N. ;
Uda, Manuela ;
Tuomilehto, Jaakko ;
Pedersen, Oluf ;
Cao, Antonio ;
Groop, Leif ;
Mohlke, Karen L. ;
Laakso, Markku ;
Schlessinger, David ;
Collins, Francis S. ;
Altshuler, David ;
Abecasis, Goncalo R. ;
Boehnke, Michael ;
Scuteri, Angelo .
JOURNAL OF CLINICAL INVESTIGATION, 2008, 118 (07) :2620-2628