Information-theoretic metrics for visualizing gene-environment interactions

被引:55
作者
Chanda, Pritam
Zhang, Aidong
Brazeau, Daniel
Sucheston, Lara
Freudenheim, Jo L.
Ambrosone, Christine
Ramanathan, Murali
机构
[1] SUNY Buffalo, Dept Pharmaceut Sci, Buffalo, NY 14260 USA
[2] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
[3] SUNY Buffalo, Dept Biostat, Buffalo, NY 14260 USA
[4] SUNY Buffalo, Dept Social & Prevent Med, Buffalo, NY 14260 USA
[5] Roswell Pk Canc Inst, Dept Canc Prevent & Control, Buffalo, NY 14263 USA
关键词
D O I
10.1086/521878
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学]; 090102 [作物遗传育种];
摘要
The purpose of our work was to develop heuristics for visualizing and interpreting gene-environment interactions (GEIs) and to assess the dependence of candidate visualization metrics on biological and study-design factors. Two information-theoretic metrics, the k-way interaction information (KWII) and the total correlation information (TCI), were investigated. The effectiveness of the KWII and TCI to detect GEIs in a diverse range of simulated data sets and a Crohn disease data set was assessed. The sensitivity of the KWII and TCI spectra to biological and study-design variables was determined. Head-to-head comparisons with the relevance-chain, multifactor dimensionality reduction, and the pedigree disequilibrium test (PDT) methods were obtained. The KWII and TCI spectra, which are graphical summaries of the KWII and TCI for each subset of environmental and genotype variables, were found to detect each known GEI in the simulated data sets. The patterns in the KWII and TCI spectra were informative for factors such as case-control misassignment, locus heterogeneity, allele frequencies, and linkage disequilibrium. The KWII and TCI spectra were found to have excellent sensitivity for identifying the key disease-associated genetic variations in the Crohn disease data set. In head-to-head comparisons with the relevance-chain, multifactor dimensionality reduction, and PDT methods, the results from visual interpretation of the KWII and TCI spectra performed satisfactorily. The KWII and TCI are promising metrics for visualizing GEIs. They are capable of detecting interactions among numerous single-nucleotide polymorphisms and environmental variables for a diverse range of GEI models.
引用
收藏
页码:939 / 963
页数:25
相关论文
共 37 条
[1]
A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[2]
Ambrosone CB, 2007, JNCI-J NATL CANCER I, V99, P487, DOI 10.1093/jnci/djk097
[3]
Concordance of multiple analytical approaches demonstrates a complex relationship between DNA repair gene SNPs, smoking and bladder cancer susceptibility [J].
Andrew, AS ;
Nelson, HH ;
Kelsey, KT ;
Moore, JH ;
Meng, AC ;
Casella, DP ;
Tosteson, TD ;
Schned, AR ;
Karagas, MR .
CARCINOGENESIS, 2006, 27 (05) :1030-1037
[4]
[Anonymous], 2005, Machine learning based on attribute interactions
[5]
BELL AJ, 2003, 4 INT S IND COMP AN
[6]
CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]
Information-theoretic identification of predictive SNPs and supervised visualization of genome-wide association studies [J].
Bhasi, Kavitha ;
Zhang, Li ;
Brazeau, Daniel ;
Zhang, Aidong ;
Ramanathan, Murali .
NUCLEIC ACIDS RESEARCH, 2006, 34 (14)
[8]
VizStruct for visualization of genome-wide SNP analyses [J].
Bhasi, Kavitha ;
Zhang, Li ;
Brazeau, Daniel ;
Zhang, Aidong ;
Ramanathan, Murali .
BIOINFORMATICS, 2006, 22 (13) :1569-1576
[9]
Parallel multifactor dimensionality reduction: a tool for the large-scale analysis of gene-gene interactions [J].
Bush, William S. ;
Dudek, Scott M. ;
Ritchie, Marylyn D. .
BIOINFORMATICS, 2006, 22 (17) :2173-2174
[10]
Multifactor-dimensionality reduction shows a two-locus interaction associated with Type 2 diabetes mellitus [J].
Cho, YM ;
Ritchie, MD ;
Moore, JH ;
Park, JY ;
Lee, KU ;
Shin, HD ;
Lee, HK ;
Park, KS .
DIABETOLOGIA, 2004, 47 (03) :549-554