Inference of high resolution HLA types using genome-wide RNA or DNA sequencing reads

被引:63
作者
Bai, Yu [1 ]
Ni, Min [1 ]
Cooper, Blerta [1 ]
Wei, Yi [1 ]
Fury, Wen [1 ]
机构
[1] Regeneron Pharmaceut Inc, Tarrytown, NY 10591 USA
关键词
HLA typing; Transcriptome sequencing; Exome sequencing; Whole genome sequencing; Hematopoietic transplantation; Autoimmune disease; Immunoncology; Human genetics; HUMAN-LEUKOCYTE ANTIGEN; SUSCEPTIBILITY; EXPRESSION; ALIGNMENT; DISEASE; ASSOCIATION; RESISTANCE; KNOWLEDGE;
D O I
10.1186/1471-2164-15-325
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 [微生物学]; 090105 [作物生产系统与生态工程];
摘要
Background: Accurate HLA typing at amino acid level (four-digit resolution) is critical in hematopoietic and organ transplantations, pathogenesis studies of autoimmune and infectious diseases, as well as the development of immunoncology therapies. With the rapid adoption of genome-wide sequencing in biomedical research, HLA typing based on transcriptome and whole exome/genome sequencing data becomes increasingly attractive due to its high throughput and convenience. However, unlike targeted amplicon sequencing, genome-wide sequencing often employs a reduced read length and coverage that impose great challenges in resolving the highly homologous HLA alleles. Though several algorithms exist and have been applied to four-digit typing, some deliver low to moderate accuracies, some output ambiguous predictions. Moreover, few methods suit diverse read lengths and depths, and both RNA and DNA sequencing inputs. New algorithms are therefore needed to leverage the accuracy and flexibility of HLA typing at high resolution using genome-wide sequencing data. Results: We have developed a new algorithm named PHLAT to discover the most probable pair of HLA alleles at four-digit resolution or higher, via a unique integration of a candidate allele selection and a likelihood scoring. Over a comprehensive set of benchmarking data (a total of 768 HLA alleles) from both RNA and DNA sequencing and with a broad range of read lengths and coverage, PHLAT consistently achieves a high accuracy at four-digit (92%-95%) and two-digit resolutions (96%-99%), outcompeting most of the existing methods. It also supports targeted amplicon sequencing data from Illumina Miseq. Conclusions: PHLAT significantly leverages the accuracy and flexibility of high resolution HLA typing based on genome-wide sequencing data. It may benefit both basic and applied research in immunology and related fields as well as numerous clinical applications.
引用
收藏
页数:16
相关论文
共 49 条
[1]
The HLA crossroad in tumor immunology [J].
Algarra, I ;
Cabrera, T ;
Garrido, F .
HUMAN IMMUNOLOGY, 2000, 61 (01) :65-73
[2]
A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]
T-cell allorecognition: a case of mistaken identity or deja vu? [J].
Archbold, Julia K. ;
Macdonald, Whitney A. ;
Burrows, Scott R. ;
Rossjohn, Jamie ;
McCluskey, James .
TRENDS IN IMMUNOLOGY, 2008, 29 (05) :220-226
[4]
HLA typing from RNA-Seq sequence reads [J].
Boegel, Sebastian ;
Loewer, Martin ;
Schaefer, Michael ;
Bukur, Thomas ;
de Graaf, Jos ;
Boisguerin, Valesca ;
Tuereci, Oezlem ;
Diken, Mustafa ;
Castle, John C. ;
Sahin, Ugur .
GENOME MEDICINE, 2012, 4
[5]
Widespread Impact of HLA Restriction on Immune Control and Escape Pathways of HIV-1 [J].
Carlson, Jonathan M. ;
Listgarten, Jennifer ;
Pfeifer, Nico ;
Tan, Vincent ;
Kadie, Carl ;
Walker, Bruce D. ;
Ndung'u, Thumbi ;
Shapiro, Roger ;
Frater, John ;
Brumme, Zabrina L. ;
Goulder, Philip J. R. ;
Heckerman, David .
JOURNAL OF VIROLOGY, 2012, 86 (09) :5230-5243
[6]
HLA and Autoimmune Digestive Disease: A Clinically Oriented Review for Gastroenterologists [J].
Cassinotti, Andrea ;
Birindelli, Sarah ;
Clerici, Mario ;
Trabattoni, Daria ;
Lazzaroni, Marco ;
Ardizzone, Sandro ;
Colombo, Riccardo ;
Rossi, Edoardo ;
Porro, Gabriele Bianchi .
AMERICAN JOURNAL OF GASTROENTEROLOGY, 2009, 104 (01) :195-217
[7]
Genetic susceptibility to breast cancer: HLA DQB*03032 and HLA DRB1*11 may represent protective alleles [J].
Chaudhuri, S ;
Cariappa, A ;
Tang, M ;
Bell, D ;
Haber, DA ;
Isselbacher, KJ ;
Finkelstein, D ;
Forcione, D ;
Pillai, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (21) :11451-11454
[8]
A knowledge-based move set for protein folding [J].
Chen, William W. ;
Yang, Jac Shick ;
Shakhnovich, Eugene I. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 66 (03) :682-688
[9]
The HLA system: Genetics, immunology, clinical testing, and clinical implications [J].
Choo, Sung Yoon .
YONSEI MEDICAL JOURNAL, 2007, 48 (01) :11-23
[10]
Rapid, scalable and highly automated HLA genotyping using next-generation sequencing: a transition from research to diagnostics [J].
Danzer, Martin ;
Niklas, Norbert ;
Stabentheiner, Stephanie ;
Hofer, Katja ;
Proell, Johannes ;
Stueckler, Christina ;
Raml, Edeltraud ;
Polin, Helene ;
Gabriel, Christian .
BMC GENOMICS, 2013, 14