Entropy-based joint analysis for two-stage genome-wide association studies

被引:10
作者
Kang, Guolian [1 ]
Zuo, Yijun [1 ]
机构
[1] Michigan State Univ, Dept Stat & Probabil, E Lansing, MI 48824 USA
关键词
complex diseases; entropy; false discovery rate; genetic variants;
D O I
10.1007/s10038-007-0177-7
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome-wide association studies (GWAS) are being conducted to identify common genetic variants that predispose to human diseases to unravel the genetic etiology of complex human diseases now. Because of genotyping cost constraints, it often follows a two-stage design, in which a large number of markers are identified in a proportion of the available samples in stage 1, and then the markers identified in stage 1 are examined in all the samples in stage 2. In this paper, we introduce a nonlinear entropy-based statistic for joint analysis for two-stage genome-wide association studies. Type I error rates and power of the entropy-based statistic for association tests are validated using simulation studies in single-locus test. The power of entropy-based joint analysis is investigated by simulations. And the results suggest that entropy-based joint analysis is always more powerful than linear joint analysis that uses a linear function of risk allele frequencies in cases and controls when detecting rare genetic variants; the powers of these two joint analyses are comparable when detecting common genetic variants. Furthermore, when the false discovery rate is controlled, entropy-based joint analysis is more powerful and needs fewer samples than linear joint analysis that uses a linear function of risk allele frequencies in cases and controls. So, we recommend we should use entropy-based strategy for two-stage genome-wide association studies to detect the rare and common genetic variants with moderate to large genetic effect underlying a complex disease.
引用
收藏
页码:747 / 756
页数:10
相关论文
共 23 条
[11]   Genome-wide strategies for detecting multiple loci that influence complex diseases [J].
Marchini, J ;
Donnelly, P ;
Cardon, LR .
NATURE GENETICS, 2005, 37 (04) :413-417
[12]   The future of genetic studies of complex human diseases [J].
Risch, N ;
Merikangas, K .
SCIENCE, 1996, 273 (5281) :1516-1517
[13]   A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms [J].
Sachidanandam, R ;
Weissman, D ;
Schmidt, SC ;
Kakol, JM ;
Stein, LD ;
Marth, G ;
Sherry, S ;
Mullikin, JC ;
Mortimore, BJ ;
Willey, DL ;
Hunt, SE ;
Cole, CG ;
Coggill, PC ;
Rice, CM ;
Ning, ZM ;
Rogers, J ;
Bentley, DR ;
Kwok, PY ;
Mardis, ER ;
Yeh, RT ;
Schultz, B ;
Cook, L ;
Davenport, R ;
Dante, M ;
Fulton, L ;
Hillier, L ;
Waterston, RH ;
McPherson, JD ;
Gilman, B ;
Schaffner, S ;
Van Etten, WJ ;
Reich, D ;
Higgins, J ;
Daly, MJ ;
Blumenstiel, B ;
Baldwin, J ;
Stange-Thomann, NS ;
Zody, MC ;
Linton, L ;
Lander, ES ;
Altshuler, D .
NATURE, 2001, 409 (6822) :928-933
[14]   Optimal two-stage genotyping in population-based association studies [J].
Satagopan, JM ;
Elston, RC .
GENETIC EPIDEMIOLOGY, 2003, 25 (02) :149-157
[15]   Two-stage designs for gene-disease association studies [J].
Satagopan, JM ;
Verbel, DA ;
Venkatraman, ES ;
Offit, KE ;
Begg, CB .
BIOMETRICS, 2002, 58 (01) :163-170
[16]   Two-stage designs for gene-disease association studies with sample size constraints [J].
Satagopan, JM ;
Venkatraman, ES ;
Begg, CB .
BIOMETRICS, 2004, 60 (03) :589-597
[17]   Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies [J].
Skol, AD ;
Scott, LJ ;
Abecasis, GR ;
Boehnke, M .
NATURE GENETICS, 2006, 38 (02) :209-213
[18]   Two-stage sampling designs for gene association studies [J].
Thomas, D ;
Xie, RR ;
Gebregziabher, M .
GENETIC EPIDEMIOLOGY, 2004, 27 (04) :401-414
[19]   Recent developments in genomewide association scans: A workshop summary and review [J].
Thomas, DC ;
Haile, RW ;
Duggan, D .
AMERICAN JOURNAL OF HUMAN GENETICS, 2005, 77 (03) :337-345
[20]   Nonlinear tests for genomewide association studies [J].
Zhao, Jinying ;
Jin, Li ;
Xiong, Momiao .
GENETICS, 2006, 174 (03) :1529-1538