Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity

被引:441
作者
Ritchie, MD [1 ]
Hahn, LW [1 ]
Moore, JH [1 ]
机构
[1] Vanderbilt Univ, Sch Med, Program Human Genet, Dept Mol Physiol & Biophys, Nashville, TN 37232 USA
关键词
epistasis; multilocus; methods; simulation; complex diseases;
D O I
10.1002/gepi.10218
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The identification and characterization of genes that influence the risk of common, complex multifactorial diseases, primarily through interactions with other genes and other environmental factors, remains a statistical and computational challenge in genetic epidemiology. This challenge is partly due to the limitations of parametric statistical methods for detecting genetic effects that are dependent solely or partially on interactions with other genes and environmental exposures. We previously introduced multifactor dimensionality reduction (MDR) as a method for reducing the dimensionality of multilocus genotype information to improve the identification of polymorphism combinations associated with disease risk. The MDR approach is nonparametric (i.e., no hypothesis about the value of a statistical parameter is made), is model-free (i.e., assumes no particular inheritance model), and is directly applicable to case-control and discordant sib-pair study designs. Both empirical and theoretical studies suggest that MDR has excellent power for identifying high-order gene-gene interactions. However, the power of MDR for identifying gene-gene interactions in the presence of common sources of noise is not currently known. The goal of this study was to evaluate the power of MDR for identifying gene-gene interactions in the presence of noise due to genotyping error, missing data, phenocopy, and genetic or locus heterogeneity. Using simulated data, we show that MDR has high power to identify gene-gene interactions in the presence of 5% genotyping error, 5% missing data, or a combination of both. However, MDR has reduced power for some models in the presence of 50% phenocopy, and very limited power in the presence of 50% genetic heterogeneity. Extending MDR to address genetic heterogeneity should be a priority for the continued methodological development of this new approach. (C) 2003 Wiley-Liss, Inc.
引用
收藏
页码:150 / 157
页数:8
相关论文
共 19 条
[1]   The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures [J].
Akey, JM ;
Zhang, K ;
Xiong, MM ;
Doris, P ;
Jin, L .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (06) :1447-1456
[2]  
Anderson J., 1995, INTRO NEURAL NETWORK, DOI DOI 10.7551/MITPRESS/3905.001.0001
[3]  
[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
[4]  
[Anonymous], P GEN EV ALG C
[5]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[6]   Genetic determinants of type 2 diabetes mellitus [J].
Busch, CP ;
Hegele, RA .
CLINICAL GENETICS, 2001, 60 (04) :243-254
[7]   THE RISK OF DETERMINING RISK WITH MULTIVARIABLE MODELS [J].
CONCATO, J ;
FEINSTEIN, AR ;
HOLFORD, TR .
ANNALS OF INTERNAL MEDICINE, 1993, 118 (03) :201-210
[8]   Who's afraid of epistasis? [J].
Frankel, WN ;
Schork, NJ .
NATURE GENETICS, 1996, 14 (04) :371-373
[9]  
HAHN LW, 2003, IN PRESS BIOINFORMAT
[10]  
Hosmer W., 2000, Applied Logistic Regression, VSecond