PolyTB: A genomic variation map for Mycobacterium tuberculosis

被引:72
作者
Coll, Francesc [1 ]
Preston, Mark [1 ]
Guerra-Assuncao, Jose Afonso [2 ]
Hill-Cawthorn, Grant [3 ,4 ,5 ]
Harris, David [6 ]
Perdigao, Joao [7 ]
Viveiros, Miguel [8 ]
Portugal, Isabel [7 ]
Drobniewski, Francis [9 ]
Gagneux, Sebastien [10 ]
Glynn, Judith R. [2 ]
Pain, Arnab [3 ]
Parkhill, Julian [6 ]
McNerney, Ruth [1 ]
Martin, Nigel [11 ]
Clark, Taane G. [1 ,2 ]
机构
[1] Univ London London Sch Hyg & Trop Med, Fac Infect & Trop Dis, London WC1E 7HT, England
[2] Univ London London Sch Hyg & Trop Med, Fac Epidemiol & Populat Hlth, London WC1E 7HT, England
[3] King Abdullah Univ Sci & Technol, Pathogen Genom Lab, Thuwal, Saudi Arabia
[4] Sydney Emerging Infect & Biosecur Inst, Sydney, NSW 2006, Australia
[5] Sch Publ Hlth, Sydney, NSW 2006, Australia
[6] Wellcome Trust Sanger Inst, Pathogen Genom Fac, Cambridge, England
[7] Univ Lisbon, Fac Farm, Ctr Patogenese Mol, P-1649003 Lisbon, Portugal
[8] Univ Nova Lisboa, Inst Higiene & Med Trop, Unidade Microbiol Med, Grp Micobacterias, P-1349008 Lisbon, Portugal
[9] Queen Mary Univ London, Ctr Immunol & Infect Dis, London E1 2AT, England
[10] Swiss Trop & Publ Hlth Inst, CH-4002 Basel, Switzerland
[11] Univ London Birkbeck Coll, Sch Comp Sci & Informat Syst, London WC1E 7HX, England
关键词
Mycobacterium tuberculosis; Database; Genomics; Software; Molecular epidemiology; Whole-genome sequencing; DRUG-RESISTANT TUBERCULOSIS; PAIRED-END; DIVERSITY; EPIDEMIOLOGY; RESOLUTION; ALGORITHM; EVOLUTION; INSIGHTS; PATIENT;
D O I
10.1016/j.tube.2014.02.005
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
071005 [微生物学]; 100108 [医学免疫学];
摘要
Tuberculosis (TB) caused by Mycobacterium tuberculosis (Mtb) is the second major cause of death from an infectious disease worldwide. Recent advances in DNA sequencing are leading to the ability to generate whole genome information in clinical isolates of M. tuberculosis complex (MTBC). The identification of informative genetic variants such as phylogenetic markers and those associated with drug resistance or virulence will help barcode Mtb in the context of epidemiological, diagnostic and clinical studies. Mtb genomic datasets are increasingly available as raw sequences, which are potentially difficult and computer intensive to process, and compare across studies. Here we have processed the raw sequence data (> 1500 isolates, eight studies) to compile a catalogue of SNPs (n = 74,039, 63% non-synonymous, 51.1% in more than one isolate, i.e. non-private), small indels (n = 4810) and larger structural variants (n = 800). We have developed the PolyTB web-based tool (http://pathogenseq.lshtm.ac.uk/polytb) to visualise the resulting variation and important meta-data (e.g. in silico inferred strain-types, location) within geographical map and phylogenetic views. This resource will allow researchers to identify polymorphisms within candidate genes of interest, as well as examine the genomic diversity and distribution of strains. PolyTB source code is freely available to researchers wishing to develop similar tools for their pathogen of interest. (C) 2014 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:346 / 354
页数:9
相关论文
共 52 条
[1]
Tuberculosis 2013:5 Drug-resistant tuberculosis: time for visionary political leadership [J].
Abubakar, Ibrahim ;
Zignol, Matteo ;
Falzon, Dennis ;
Raviglione, Mario ;
Ditiu, Lucica ;
Masham, Susan ;
Adetifa, Lfedayo ;
Ford, Nathan ;
Cox, Helen ;
Lawn, Stephen D. ;
Marais, Ben J. ;
McHugh, Timothy D. ;
Mwaba, Peter ;
Bates, Matthew ;
Lipman, Marc ;
Zijenah, Lynn ;
Logan, Simon ;
McNerney, Ruth ;
Zumla, Adam ;
Sarda, Krishna ;
Nahid, Payam ;
Hoelscher, Michael ;
Pletschette, Michel ;
Memish, Ziad A. ;
Kim, Peter ;
Hafner, Richard ;
Cole, Stewart ;
Migliori, Giovanni Battista ;
Maeurer, Markus ;
Schito, Marco ;
Zumla, Alimuddin .
LANCET INFECTIOUS DISEASES, 2013, 13 (06) :529-539
[2]
CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing [J].
Abyzov, Alexej ;
Urban, Alexander E. ;
Snyder, Michael ;
Gerstein, Mark .
GENOME RESEARCH, 2011, 21 (06) :974-984
[3]
AGE: defining breakpoints of genomic structural variants at single-nucleotide resolution, through optimal alignments with gap excision [J].
Abyzov, Alexej ;
Gerstein, Mark .
BIOINFORMATICS, 2011, 27 (05) :595-603
[4]
APPLICATIONS OF NEXT-GENERATION SEQUENCING Genome structural variation discovery and genotyping [J].
Alkan, Can ;
Coe, Bradley P. ;
Eichler, Evan E. .
NATURE REVIEWS GENETICS, 2011, 12 (05) :363-375
[5]
Role of large sequence polymorphisms (LSPs) in generating genomic diversity among clinical isolates of mycobacterium tuberculosis and the utility of LSPs in phylogenetic analysis [J].
Alland, David ;
Lacher, David W. ;
Hazbon, Man'zour Hernando ;
Motiwala, Alifiya S. ;
Qi, Weihong ;
Fleischmann, Robert D. ;
Whittam, Thomas S. .
JOURNAL OF CLINICAL MICROBIOLOGY, 2007, 45 (01) :39-46
[6]
[Anonymous], 2005, PHYLIP (phylogeny inference package) version 3.6
[7]
Combined Species Identification, Genotyping, and Drug Resistance Detection of Mycobacterium tuberculosis Cultures by MLPA on a Bead-Based Array [J].
Bergval, Indra ;
Sengstake, Sarah ;
Brankova, Nadia ;
Levterova, Viktoria ;
Abadia, Edgar ;
Tadumaze, Nino ;
Bablishvili, Nino ;
Akhalaia, Maka ;
Tuin, Kiki ;
Schuitema, Anja ;
Panaiotov, Stefan ;
Bachiyska, Elizabeta ;
Kantardjiev, Todor ;
de Zwaan, Rina ;
Schurch, Anita ;
van Soolingen, Dick ;
van 't Hoog, Anja ;
Cobelens, Frank ;
Aspindzelashvili, Rusudan ;
Sola, Christophe ;
Klatser, Paul ;
Anthony, Richard .
PLOS ONE, 2012, 7 (08)
[8]
Significance of the Identification in the Horn of Africa of an Exceptionally Deep Branching Mycobacterium tuberculosis Clade [J].
Blouin, Yann ;
Hauck, Yolande ;
Soler, Charles ;
Fabre, Michel ;
Vong, Rithy ;
Dehan, Celine ;
Cazajous, Geraldine ;
Massoure, Pierre-Laurent ;
Kraemer, Philippe ;
Jenkins, Akinbowale ;
Garnotel, Eric ;
Pourcel, Christine ;
Vergnaud, Gilles .
PLOS ONE, 2012, 7 (12)
[9]
Brosch R, 2004, PEDIAT PULMONOL, V38, P1
[10]
Inferring patient to patient transmission of Mycobacterium tuberculosis from whole genome sequencing data [J].
Bryant, Josephine M. ;
Schurch, Anita C. ;
van Deutekom, Henk ;
Harris, Simon R. ;
de Beer, Jessica L. ;
de Jager, Victor ;
Kremer, Kristin ;
van Hijum, Sacha A. F. T. ;
Siezen, Roland J. ;
Borgdorff, Martien ;
Bentley, Stephen D. ;
Parkhill, Julian ;
van Soolingen, Dick .
BMC INFECTIOUS DISEASES, 2013, 13