Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens

被引:226
作者
Jia, Xiaoming [1 ]
Han, Buhm [2 ,3 ,4 ]
Onengut-Gumuscu, Suna [5 ]
Chen, Wei-Min [5 ]
Concannon, Patrick J. [5 ]
Rich, Stephen S. [5 ]
Raychaudhuri, Soumya [2 ,3 ,4 ,6 ,7 ]
de Bakker, Paul I. W. [2 ,3 ,8 ,9 ]
机构
[1] Harvard MIT Massachusetts Inst Technol, Div Hlth Sci & Technol, Boston, MA USA
[2] Harvard Univ, Brigham & Womens Hosp, Sch Med, Div Genet, Boston, MA 02115 USA
[3] Broad Inst Harvard & MIT, Program Med & Populat Genet, Cambridge, MA USA
[4] Harvard Univ, Brigham & Womens Hosp, Sch Med, Div Rheumatol, Boston, MA 02115 USA
[5] Univ Virginia, Sch Med, Charlottesville, VA 22908 USA
[6] Partners HealthCare Ctr Personalized Genet Med, Boston, MA USA
[7] Univ Manchester, Fac Med & Human Sci, Manchester, Lancs, England
[8] Univ Med Ctr Utrecht, Dept Epidemiol, Utrecht, Netherlands
[9] Univ Med Ctr Utrecht, Dept Med Genet, Utrecht, Netherlands
基金
英国医学研究理事会; 美国国家卫生研究院; 英国惠康基金;
关键词
GENOME-WIDE ASSOCIATION; MAJOR HISTOCOMPATIBILITY COMPLEX; CLASSICAL HLA ALLELES; EXTENDED HUMAN MHC; GENOTYPE IMPUTATION; DISEASE ASSOCIATION; COMMON DISEASES; HAPLOTYPE; SUSCEPTIBILITY; INFERENCE;
D O I
10.1371/journal.pone.0064683
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
DNA sequence variation within human leukocyte antigen (HLA) genes mediate susceptibility to a wide range of human diseases. The complex genetic structure of the major histocompatibility complex (MHC) makes it difficult, however, to collect genotyping data in large cohorts. Long-range linkage disequilibrium between HLA loci and SNP markers across the major histocompatibility complex (MHC) region offers an alternative approach through imputation to interrogate HLA variation in existing GWAS data sets. Here we describe a computational strategy, SNP2HLA, to impute classical alleles and amino acid polymorphisms at class I (HLA-A, -B, -C) and class II (-DPA1, -DPB1, -DQA1, -DQB1, and -DRB1) loci. To characterize performance of SNP2HLA, we constructed two European ancestry reference panels, one based on data collected in HapMap-CEPH pedigrees (90 individuals) and another based on data collected by the Type 1 Diabetes Genetics Consortium (T1DGC, 5,225 individuals). We imputed HLA alleles in an independent data set from the British 1958 Birth Cohort (N = 918) with gold standard four-digit HLA types and SNPs genotyped using the Affymetrix GeneChip 500 K and Illumina Immunochip microarrays. We demonstrate that the sample size of the reference panel, rather than SNP density of the genotyping platform, is critical to achieve high imputation accuracy. Using the larger T1DGC reference panel, the average accuracy at four-digit resolution is 94.7% using the low-density Affymetrix GeneChip 500 K, and 96.7% using the high-density Illumina Immunochip. For amino acid polymorphisms within HLA genes, we achieve 98.6% and 99.3% accuracy using the Affymetrix GeneChip 500 K and Illumina Immunochip, respectively. Finally, we demonstrate how imputation and association testing at amino acid resolution can facilitate fine-mapping of primary MHC association signals, giving a specific example from type 1 diabetes.
引用
收藏
页数:10
相关论文
共 40 条
[1]
Amino acid position 11 of HLA-DRβ1 is a major determinant of chromosome 6p association with ulcerative colitis [J].
Achkar, J-P ;
Klei, L. ;
de Bakker, P. I. W. ;
Bellone, G. ;
Rebert, N. ;
Scott, R. ;
Lu, Y. ;
Regueiro, M. ;
Brzezinski, A. ;
Kamboh, M. I. ;
Fiocchi, C. ;
Devlin, B. ;
Trucco, M. ;
Ringquist, S. ;
Roeder, K. ;
Duerr, R. H. .
GENES AND IMMUNITY, 2012, 13 (03) :245-252
[2]
Drug Hypersensitivity and Human Leukocyte Antigens of the Major Histocompatibility Complex [J].
Bharadwaj, Mandvi ;
Illing, Patricia ;
Theodossis, Alex ;
Purcell, AnthonyW. ;
Rossjohn, Jamie ;
McCluskey, James .
ANNUAL REVIEW OF PHARMACOLOGY AND TOXICOLOGY, VOL 52, 2012, 52 :401-+
[3]
Common and rare variants in multifactorial susceptibility to common diseases [J].
Bodmer, Walter ;
Bonilla, Carolina .
NATURE GENETICS, 2008, 40 (06) :695-701
[4]
A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals [J].
Browning, Brian L. ;
Browning, Sharon R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (02) :210-223
[5]
Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[6]
The influence of HLA genotype on AIDS [J].
Carrington, M ;
O'Brien, SJ .
ANNUAL REVIEW OF MEDICINE, 2003, 54 :535-551
[7]
A correlation between the relative predisposition of MHC class II alleles to type 1 diabetes and the structure of their proteins [J].
Cucca, F ;
Lampis, R ;
Congia, M ;
Angius, E ;
Nutland, S ;
Bain, SC ;
Barnett, AH ;
Todd, JA .
HUMAN MOLECULAR GENETICS, 2001, 10 (19) :2025-2037
[8]
A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC [J].
de Bakker, Paul I. W. ;
McVean, Gil ;
Sabeti, Pardis C. ;
Miretti, Marcos M. ;
Green, Todd ;
Marchini, Jonathan ;
Ke, Xiayi ;
Monsuur, Alienke J. ;
Whittaker, Pamela ;
Delgado, Marcos ;
Morrison, Jonathan ;
Richardson, Angela ;
Walsh, Emily C. ;
Gao, Xiaojiang ;
Galver, Luana ;
Hart, John ;
Hafler, David A. ;
Pericak-Vance, Margaret ;
Todd, John A. ;
Daly, Mark J. ;
Trowsdale, John ;
Wijmenga, Cisca ;
Vyse, Tim J. ;
Beck, Stephan ;
Murray, Sarah Shaw ;
Carrington, Mary ;
Gregory, Simon ;
Deloukas, Panos ;
Rioux, John D. .
NATURE GENETICS, 2006, 38 (10) :1166-1172
[9]
Interrogating the major histocompatibility complex with high-throughput genomics [J].
de Bakker, Paul I. W. ;
Raychaudhuri, Soumya .
HUMAN MOLECULAR GENETICS, 2012, 21 :R29-R36
[10]
HLA*IMP-an integrated framework for imputing classical HLA alleles from SNP genotypes [J].
Dilthey, Alexander T. ;
Moutsianas, Loukas ;
Leslie, Stephen ;
McVean, Gil .
BIOINFORMATICS, 2011, 27 (07) :968-972