Gene-centric genomewide association study via entropy

被引:33
作者
Cui, Yuehua [1 ]
Kang, Guolian [1 ]
Sun, Kelian [2 ]
Qian, Minping [2 ]
Romero, Roberto [3 ]
Fu, Wenjiang [2 ]
机构
[1] Michigan State Univ, Dept Stat & Probabil, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Epidemiol, E Lansing, MI 48824 USA
[3] NICHHD, Perinatol Res Branch, NIH, Bethesda, MD USA
关键词
D O I
10.1534/genetics.107.082370
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genes are the functional units in most organisms. Compared to genetic variants located outside genes, genic variants are more likely to affect disease risk. The development of tire human HapMap project provides an unprecedented opportunity for genetic association studies at the genomewide level for elucidating disease etiology. Currently, most association studies at the single-nucleotide polymorphism (SNP) or the haplotype level rely on the linkage information between SNP markers and disease variants, with which association findings are difficult to replicate. Moreover, variants in genes might not be sufficiently covered by currently available methods. In this article, we present a gene-centric approach via entropy statistics for a genomewide association study to identify disease genes. The new entropy-based approach considers genic variants within one gene simultaneous]), and is developed on the basis of a joint genotype distribution among genetic variants for an association test. A grouping algorithm based on a penalized entropy measure is proposed to reduce the dimension of the test statistic. Type I error rates and power of the entropy test are evaluated through extensive Simulation studies. The results indicate that the entropy test has stable power under different disease models with a reasonable sample size. Compared to single SNP-based analysis, the gene-centric approach has greater power, especially when there is more than one disease variant in a gene. As the genomewide genic SNPs become available, our entropy-based gene-centric approach would provide a robust and computationally efficient way for gene-based genomewide association study.
引用
收藏
页码:637 / 650
页数:14
相关论文
共 41 条
[1]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[2]   Vascular endothelial growth factor, epidermal growth factor and fibroblast growth factor-4 and-10 stimulate trophoblast plasminogen activator system and metalloproteinase-9 [J].
Anteby, EY ;
Greenfield, C ;
Natanson-Yaron, S ;
Goldman-Wohl, D ;
Hamani, Y ;
Khudyak, V ;
Ariel, I ;
Yagel, S .
MOLECULAR HUMAN REPRODUCTION, 2004, 10 (04) :229-235
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]  
BOEHNKE M, 1994, AM J HUM GENET, V55, P379
[5]   Candidate gene analysis suggests a role for fatty acid biosynthesis and regulation of the complement system in the etiology of age-related maculopathy [J].
Conley, YP ;
Thalamuthu, A ;
Jakobsdottir, J ;
Weeks, DE ;
Mah, T ;
Ferrell, RE ;
Gorin, MB .
HUMAN MOLECULAR GENETICS, 2005, 14 (14) :1991-2002
[6]  
Cover TM., 2006, Elements of information theory, DOI [10.1002/047174882X.ch2,arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1002/047174882X.ch2, DOI 10.1002/047174882X]
[7]   Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes [J].
Durrant, C ;
Zondervan, KT ;
Cardon, LR ;
Hunt, S ;
Deloukas, P ;
Morris, AP .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 75 (01) :35-43
[8]   Paternal and maternal components of the predisposition to preeclampsia. [J].
Esplin, MS ;
Fausett, MB ;
Fraser, A ;
Kerber, R ;
Mineau, G ;
Carrillo, J ;
Varner, MW .
NEW ENGLAND JOURNAL OF MEDICINE, 2001, 344 (12) :867-872
[9]  
Fan JQ, 1996, J AM STAT ASSOC, V91, P674
[10]   Assessing the impact of population stratification on genetic association studies [J].
Freedman, ML ;
Reich, D ;
Penney, KL ;
McDonald, GJ ;
Mignault, AA ;
Patterson, N ;
Gabriel, SB ;
Topol, EJ ;
Smoller, JW ;
Pato, CN ;
Pato, MT ;
Petryshen, TYL ;
Kolonel, LN ;
Lander, ES ;
Sklar, P ;
Henderson, B ;
Hirschhorn, JN ;
Altshuler, D .
NATURE GENETICS, 2004, 36 (04) :388-393