Comparison of Statistical Tests for Disease Association With Rare Variants

被引:182
作者
Basu, Saonli [1 ]
Pan, Wei [1 ]
机构
[1] Univ Minnesota, Sch Publ Hlth, Div Biostat, Minneapolis, MN 55455 USA
关键词
C-alpha test; kernel machine regression; logistic regression; model selection; permutation; pooled association tests; random-effects models; SSU test; Sum test; statistical power; LINKAGE DISEQUILIBRIUM; GENETIC-ASSOCIATION; COMMON DISEASES; GENOME ASSOCIATION; COMPLEX DISEASES; MULTIPLE SNPS; REGRESSION; SIMILARITY; TRAITS; SUSCEPTIBILITY;
D O I
10.1002/gepi.20609
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In anticipation of the availability of next-generation sequencing data, there is increasing interest in investigating association between complex traits and rare variants (RVs). In contrast to association studies for common variants (CVs), due to the low frequencies of RVs, common wisdom suggests that existing statistical tests for CVs might not work, motivating the recent development of several new tests for analyzing RVs, most of which are based on the idea of pooling/collapsing RVs. However, there is a lack of evaluations of, and thus guidance on the use of, existing tests. Here we provide a comprehensive comparison of various statistical tests using simulated data. We consider both independent and correlated rare mutations, and representative tests for both CVs and RVs. As expected, if there are no or few non-causal (i.e. neutral or non-associated) RVs in a locus of interest while the effects of causal RVs on the trait are all (or mostly) in the same direction (i.e. either protective or deleterious, but not both), then the simple pooled association tests (without selecting RVs and their association directions) and a new test called kernel-based adaptive clustering (KBAC) perform similarly and are most powerful; KBAC is more robust than simple pooled association tests in the presence of non-causal RVs; however, as the number of non-causal CVs increases and/or in the presence of opposite association directions, the winners are two methods originally proposed for CVs and a new test called C-alpha test proposed for RVs, each of which can be regarded as testing on a variance component in a random-effects model. Interestingly, several methods based on sequential model selection (i.e. selecting causal RVs and their association directions), including two new methods proposed here, perform robustly and often have statistical power between those of the above two classes. Genet. Epidemiol. 35:606-619, 2011. (C) 2011 Wiley Periodicals, Inc.
引用
收藏
页码:606 / 619
页数:14
相关论文
共 51 条
[1]   Rare Variant Association Analysis Methods for Complex Traits [J].
Asimit, Jennifer ;
Zeggini, Eleftheria .
ANNUAL REVIEW OF GENETICS, VOL 44, 2010, 44 :293-308
[2]   Statistical analysis strategies for association studies involving rare variants [J].
Bansal, Vikas ;
Libiger, Ondrej ;
Torkamani, Ali ;
Schork, Nicholas J. .
NATURE REVIEWS GENETICS, 2010, 11 (11) :773-785
[3]   A Likelihood-Based Trait-Model-Free Approach for Linkage Detection of Binary Trait [J].
Basu, S. ;
Stephens, M. ;
Pankow, J. S. ;
Thompson, E. A. .
BIOMETRICS, 2010, 66 (01) :205-213
[4]   A Covering Method for Detecting Genetic Associations between Rare Variants and Common Phenotypes [J].
Bhatia, Gaurav ;
Bansal, Vikas ;
Harismendy, Olivier ;
Schork, Nicholas J. ;
Topol, Eric J. ;
Frazer, Kelly ;
Bafna, Vineet .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (10)
[5]   Common and rare variants in multifactorial susceptibility to common diseases [J].
Bodmer, Walter ;
Bonilla, Carolina .
NATURE GENETICS, 2008, 40 (06) :695-701
[6]   Analysis of multiple SNPs in a candidate gene or region [J].
Chapman, Juliet ;
Whittaker, John .
GENETIC EPIDEMIOLOGY, 2008, 32 (06) :560-566
[7]   A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING [J].
Chen, Song Xi ;
Qin, Ying-Li .
ANNALS OF STATISTICS, 2010, 38 (02) :808-835
[8]   Use of unphased multilocus genotype data in indirect association studies [J].
Clayton, D ;
Chapman, J ;
Cooper, J .
GENETIC EPIDEMIOLOGY, 2004, 27 (04) :415-428
[9]   So many correlated tests, so little time!: Rapid adjustment of P values for multiple correlated tests [J].
Conneely, Karen N. ;
Boehnke, Michael .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (06) :1158-1168
[10]   Genome association studies of complex diseases by case-control designs [J].
Fan, RZ ;
Knapp, M .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 72 (04) :850-868