Optimal Unified Approach for Rare-Variant Association Testing with Application to Small-Sample Case-Control Whole-Exome Sequencing Studies

被引:750
作者
Lee, Seunggeun [1 ]
Emond, Mary J. [2 ]
Bamshad, Michael J. [3 ,5 ]
Barnes, Kathleen C. [4 ]
Rieder, Mark J. [5 ]
Nickerson, Deborah A. [5 ]
Christiani, David C. [6 ,7 ]
Wurfel, Mark M. [8 ]
Lin, Xihong [1 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[3] Univ Washington, Dept Pediat, Seattle, WA 98195 USA
[4] Johns Hopkins Univ, Dept Med, Baltimore, MD 21224 USA
[5] Univ Washington, Dept Genom Sci, Seattle, WA 98195 USA
[6] Harvard Univ, Sch Publ Hlth, Dept Environm Hlth, Boston, MA 02115 USA
[7] Harvard Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
[8] Univ Washington, Div Pulm & Crit Care Med, Seattle, WA 98104 USA
关键词
CHAIN-KINASE GENE; ACUTE LUNG INJURY; COMMON DISEASES; STRATEGIES; FRAMEWORK; RISK;
D O I
10.1016/j.ajhg.2012.06.007
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We propose in this paper a unified approach for testing the association between rare variants and phenotypes in sequencing association studies. This approach maximizes power by adaptively using the data to optimally combine the burden test and the nonburden sequence kernel association test (SKAT). Burden tests are more powerful when most variants in a region are causal and the effects are in the same direction, whereas SKAT is more powerful when a large fraction of the variants in a region are noncausal or the effects of causal variants are in different directions. The proposed unified test maintains the power in both scenarios. We show that the unified test corresponds to the optimal test in an extended family of SKAT tests, which we refer to as SKAT-O. The second goal of this paper is to develop a small-sample adjustment procedure for the proposed methods for the correction of conservative type I error rates of SKAT family tests when the trait of interest is dichotomous and the sample size is small. Both small-sample-adjusted SKAT and the optimal unified test (SKAT-O) are computationally efficient and can easily be applied to genome-wide sequencing association studies. We evaluate the finite sample performance of the proposed methods using extensive simulation studies and illustrate their application using the acute-lung-injury exome-sequencing data of the National Heart, Lung, and Blood Institute Exome Sequencing Project.
引用
收藏
页码:224 / 237
页数:14
相关论文
共 37 条
[1]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]   Medical sequencing at the extremes of human body mass [J].
Ahituv, Nadav ;
Kavaslar, Nihan ;
Schackwitz, Wendy ;
Ustaszewska, Anna ;
Martin, Joel ;
Hebert, Sybil ;
Doelle, Heather ;
Ersoy, Baran ;
Kryukov, Gregory ;
Schmidt, Steffen ;
Yosef, Nir ;
Ruppin, Eytan ;
Sharan, Roded ;
Vaisse, Christian ;
Sunyaev, Shamil ;
Dent, Robert ;
Cohen, Jonathan ;
McPherson, Ruth ;
Pennacchio, Len A. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (04) :779-791
[3]  
[Anonymous], 1999, Bootstrap methods and their application
[4]   The American-European Consensus Conference on ARDS, Part 2 - Ventilatory, pharmacologic, supportive therapy, study design strategies, and issues related to recovery and remodeling [J].
Artigas, A ;
Bernard, GR ;
Carlet, J ;
Dreyfuss, D ;
Gattinoni, L ;
Hudson, L ;
Lamy, M ;
Marini, JJ ;
Matthay, MA ;
Pinsky, MR ;
Spragg, R ;
Suter, PM ;
Blanch, L ;
Burchardi, H ;
Hedenstierna, C ;
Lemaire, F ;
Roussos, C ;
Mancebo, J ;
Morris, A ;
Pesenti, A ;
Rossi, A ;
Van Asbeck, BS ;
Brigham, KL ;
Dhainaut, JF ;
Fowler, AA ;
Hyers, TM ;
Morel, D ;
Rodriguez-Roisin, R ;
Schaller, MD ;
Hemmer, M ;
Torres, A ;
Villar, J ;
Vincent, JL ;
Leeper, K ;
Meyrick, B ;
Oppenheimer, L ;
Reid, L ;
Murray, JF ;
Bihari, D ;
Bosken, C ;
Goris, J ;
Johanson, WJ ;
Lanken, PN ;
Le Gall, JR ;
Morris, AH ;
Rinaldo, J ;
Pattishal, EN .
AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 1998, 157 (04) :1332-1347
[5]   Comparison of Statistical Tests for Disease Association With Rare Variants [J].
Basu, Saonli ;
Pan, Wei .
GENETIC EPIDEMIOLOGY, 2011, 35 (07) :606-619
[6]   Common and rare variants in multifactorial susceptibility to common diseases [J].
Bodmer, Walter ;
Bonilla, Carolina .
NATURE GENETICS, 2008, 40 (06) :695-701
[7]   Variation in the myosin light chain kinase gene is associated with development of acute lung injury after major trauma [J].
Christie, Jason D. ;
Ma, Shwu-Fan ;
Aplenc, Richard ;
Li, Mingyao ;
Lanken, Paul N. ;
Shah, Chirag V. ;
Fuchs, Barry ;
Albelda, Steven M. ;
Flores, Carlos ;
Garcia, Joe G. N. .
CRITICAL CARE MEDICINE, 2008, 36 (10) :2794-2800
[8]   Multiple rare Alleles contribute to low plasma levels of HDL cholesterol [J].
Cohen, JC ;
Kiss, RS ;
Pertsemlidis, A ;
Marcel, YL ;
McPherson, R ;
Hobbs, HH .
SCIENCE, 2004, 305 (5685) :869-872
[9]  
Davies R. B., 2018, J. Royal Stat. Soc. Series C: Appl. Stat, V29, P323, DOI 10.2307/2346911
[10]   Computing the distribution of quadratic forms: Further comparisons between the Liu-Tang-Zhang approximation and exact methods [J].
Duchesne, Pierre ;
De Micheaux, Pierre Lafaye .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (04) :858-862