An entropy-based statistic for genomewide association studies

被引:39
作者
Zhao, JY
Boerwinkle, E
Xiong, MM
机构
[1] Univ Texas, Ctr Human Genet, Ctr Hlth Sci, Houston, TX 77225 USA
[2] Fudan Univ, Lab Theoret Syst Biol, Sch Life Sci, Shanghai 200433, Peoples R China
关键词
D O I
10.1086/431243
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Efficient genotyping methods and the availability of a large collection of single- nucleotide polymorphisms provide valuable tools for genetic studies of human disease. The standard x 2 statistic for case- control studies, which uses a linear function of allele frequencies, has limited power when the number of marker loci is large. We introduce a novel test statistic for genetic association studies that uses Shannon entropy and a nonlinear function of allele frequencies to amplify the differences in allele and haplotype frequencies to maintain statistical power with large numbers of marker loci. We investigate the relationship between the entropy- based test statistic and the standard x(2) statistic and show that, in most cases, the power of the entropy- based statistic is greater than that of the standard x(2) statistic. The distribution of the entropy- based statistic and the type I error rates are validated using simulation studies. Finally, we apply the new entropy- based test statistic to two real data sets, one for the COMT gene and schizophrenia and one for the MMP- 2 gene and esophageal carcinoma, to evaluate the performance of the new method for genetic association studies. The results show that the entropy- based statistic obtained smaller P values than did the standard x 2 statistic.
引用
收藏
页码:27 / 40
页数:14
相关论文
共 36 条
[1]   Haplotypes vs single marker linkage disequilibrium tests:: what do we gain? (Reprinted European Journal of Human Genetics, Vol 4, pg 291-300, 2001) [J].
Akey, Joshua ;
Jin, Li ;
Xiong, Momiao .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2017, 25 :S51-S58
[2]  
Anderson TW., 1984, INTRO MULTIVARIATE S
[3]  
[Anonymous], 1948, Tech. J., V27, P379
[4]   Search for multifactorial disease susceptibility genes in founder populations [J].
Bourgain, C ;
Genin, E ;
Quesneville, H ;
Clerget-Darpoux, F .
ANNALS OF HUMAN GENETICS, 2000, 64 :255-265
[5]  
BOURGAIN C, 2001, ANN HUM GENET, V21, pS560
[6]   Mapping complex disease loci in whole-genome association studies [J].
Carlson, CS ;
Eberle, MA ;
Kruglyak, L ;
Nickerson, DA .
NATURE, 2004, 429 (6990) :446-452
[7]   Genome screens using linkage disequilibrium tests: Optimal marker characteristics and feasibility [J].
Chapman, NH ;
Wijsman, EM .
AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 63 (06) :1872-1885
[8]   The case for a US prospective cohort study of genes and environment [J].
Collins, FS .
NATURE, 2004, 429 (6990) :475-477
[9]   Haplotype identity between individuals who share a CFTR mutation allele ''identical by descent'': Demonstration of the usefulness of the haplotype-sharing concept for gene mapping in real populations [J].
deVries, HG ;
vanderMeulen, MA ;
Rozen, R ;
Halley, DJJ ;
Scheffer, H ;
tenKate, LP ;
Buys, CHCM ;
teMeerman, GJ .
HUMAN GENETICS, 1996, 98 (03) :304-309
[10]   Assessing the impact of population stratification on genetic association studies [J].
Freedman, ML ;
Reich, D ;
Penney, KL ;
McDonald, GJ ;
Mignault, AA ;
Patterson, N ;
Gabriel, SB ;
Topol, EJ ;
Smoller, JW ;
Pato, CN ;
Pato, MT ;
Petryshen, TYL ;
Kolonel, LN ;
Lander, ES ;
Sklar, P ;
Henderson, B ;
Hirschhorn, JN ;
Altshuler, D .
NATURE GENETICS, 2004, 36 (04) :388-393