Identity-by-descent filtering as a tool for the identification of disease alleles in exome sequence data from distant relatives

被引:4
作者
Nirmala Akula
Sevilla Detera-Wadleigh
Yin Yao Shugart
Michael Nalls
Jo Steele
Francis J McMahon
机构
[1] National Institutes of Health,Mood and Anxiety Section, Human Genetics Branch, National Institute of Mental Health
[2] National Institutes of Health,Unit of Statistical Genomics, National Institute of Mental Health
[3] National Institute on Aging,Molecular Genetics Section, Laboratory of Neurogenetics, Intramural Research Program
[4] National Institutes of Health,undefined
关键词
Locus Heterogeneity; Case Subject; Genetic Analysis Workshop; Allelic Heterogeneity; GAW17 Data;
D O I
10.1186/1753-6561-5-S9-S76
中图分类号
学科分类号
摘要
Large-scale, deep resequencing may be the next logical step in the genetic investigation of common complex diseases. Because each individual is likely to carry many thousands of variants, the identification of causal alleles requires an efficient strategy to reduce the number of candidate variants. Under many genetic models, causal alleles can be expected to reside within identity-by-descent (IBD) regions shared by affected relatives. In distant relatives, IBD regions constitute a small portion of the genome and can thus greatly reduce the search space for causal alleles. However, the effectiveness of this strategy is unknown. We test the simulated mini-exome data set in extended pedigrees provided by Genetic Analysis Workshop 17. At the fourth- and fifth-degree level of relatedness, case-case pairs shared between 1% and 9% of the genome identical by descent. As expected, no genes were shared identical by descent by all case subjects, but 43 genes were shared by many case subjects across at least 50 replicates. We filtered variants in these genes based on population frequency, function, informativeness, and evidence of association using the family-based association test. This analysis highlighted five genes previously implicated in triglyceride, lipid, and cholesterol metabolism. Comparison with the list of true risk alleles revealed that strict IBD filtering followed by association testing of the rarest alleles was the most sensitive strategy. IBD filtering may be a useful strategy for narrowing down the list of candidate variants in exome data, but the optimal degree of relatedness of affected pairs will depend on the genetic architecture of the disease under study.
引用
收藏
相关论文
共 89 条
[1]  
Maher B(2008)Personal genomes: the case of the missing heritability Nature 456 18-21
[2]  
Cirulli ET(2010)Uncovering the roles of rare variants in common disease through whole-genome sequencing Nat Rev Genet 11 415-425
[3]  
Goldstein DB(2010)Exome sequencing identifies the cause of a Mendelian disorder Nat Genet 42 30-35
[4]  
Ng SB(2010)Exome sequencing identifies Nat Genet 42 790-793
[5]  
Buckingham KJ(2010) mutations as a cause of Kabuki syndrome Nat Genet 42 827-829
[6]  
Lee C(2010)Identity-by-descent filtering of exome sequence data identifies PLoS Biol 8 e1000294-639
[7]  
Bigham AW(2010) mutations in hyperphosphatasia mental retardation syndrome Science 328 636-539
[8]  
Tabor HK(2011)Rare variants create synthetic genome-wide associations BMC Proc 5 S2-326
[9]  
Dent KM(2010)Analysis of genetic inheritance in a family quartet by whole-genome sequencing Am J Hum Genet 86 526-575
[10]  
Huff CD(2009)Genetic Analysis Workshop 17 mini-exome simulation Genome Res 19 318-233