High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach

被引:109
作者
Allard, Marc W. [1 ]
Luo, Yan [2 ]
Strain, Errol [2 ]
Li, Cong [1 ]
Keys, Christine E. [1 ]
Son, Insook [1 ]
Stones, Robert [3 ]
Musser, Steven M. [1 ]
Brown, Eric W. [1 ]
机构
[1] US FDA, Off Regulatory Sci, Ctr Food Safety & Appl Nutr, College Pk, MD 20740 USA
[2] US FDA, Off Food Def Commun & Emergency Response, Ctr Food Safety & Appl Nutr, College Pk, MD 20740 USA
[3] Food & Environm Res Agcy, York YO41 1LZ, N Yorkshire, England
关键词
FIELD GEL-ELECTROPHORESIS; ESCHERICHIA-COLI O157-H7; VARIABLE NUMBER; HIGH-THROUGHPUT; SEROTYPES; DNA; PHYLOGENY; EVOLUTION; ALIGNMENT; ACCURACY;
D O I
10.1186/1471-2164-13-32
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 [微生物学]; 090105 [作物生产系统与生态工程];
摘要
Background: Next-Generation Sequencing (NGS) is increasingly being used as a molecular epidemiologic tool for discerning ancestry and traceback of the most complicated, difficult to resolve bacterial pathogens. Making a linkage between possible food sources and clinical isolates requires distinguishing the suspected pathogen from an environmental background and placing the variation observed into the wider context of variation occurring within a serovar and among other closely related foodborne pathogens. Equally important is the need to validate these high resolution molecular tools for use in molecular epidemiologic traceback. Such efforts include the examination of strain cluster stability as well as the cumulative genetic effects of sub-culturing on these clusters. Numerous isolates of S. Montevideo were shot-gun sequenced including diverse lineage representatives as well as numerous replicate clones to determine how much variability is due to bias, sequencing error, and or the culturing of isolates. All new draft genomes were compared to 34 S. Montevideo isolates previously published during an NGS-based molecular epidemiological case study. Results: Intraserovar lineages of S. Montevideo differ by thousands of SNPs, that are only slightly less than the number of SNPs observed between S. Montevideo and other distinct serovars. Much less variability was discovered within an individual S. Montevideo clade implicated in a recent foodborne outbreak as well as among individual NGS replicates. These findings were similar to previous reports documenting homopolymeric and deletion error rates with the Roche 454 GS Titanium technology. In no case, however, did variability associated with sequencing methods or sample preparations create inconsistencies with our current phylogenetic results or the subsequent molecular epidemiological evidence gleaned from these data. Conclusions: Implementation of a validated pipeline for NGS data acquisition and analysis provides highly reproducible results that are stable and predictable for molecular epidemiological applications. When draft genomes are collected at 15x-20x coverage and passed through a quality filter as part of a data analysis pipeline, including sub-passaged replicates defined by a few SNPs, they can be accurately placed in a phylogenetic context. This reproducibility applies to all levels within and between serovars of Salmonella suggesting that investigators using these methods can have confidence in their conclusions.
引用
收藏
页数:18
相关论文
共 39 条
[1]
[Anonymous], 2006, GENETIC ALGORITHM AP
[2]
SALMONELLA REFERENCE COLLECTION-B (SARB) - STRAINS OF 37 SEROVARS OF SUBSPECIES-I [J].
BOYD, EF ;
WANG, FS ;
BELTRAN, P ;
PLOCK, SA ;
NELSON, K ;
SELANDER, RK .
JOURNAL OF GENERAL MICROBIOLOGY, 1993, 139 :1125-1132
[3]
Forensics and mitochondrial DNA: Applications, debates, and foundations [J].
Budowle, B ;
Allard, MW ;
Wilson, MR ;
Chakraborty, R .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2003, 4 :119-141
[4]
The Economics of Enteric Infections: Human Foodborne Disease Costs [J].
Buzby, Jean C. ;
Roberts, Tanya .
GASTROENTEROLOGY, 2009, 136 (06) :1851-1862
[5]
Molecular applications for identifying microbial pathogens in the post-9/11 era [J].
Cebula, TA ;
Brown, EW ;
Jackson, SA ;
Mammel, MK ;
Mukherjee, A ;
LeClerc, JE .
EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2005, 5 (03) :431-445
[6]
Chin C., 2010, NEW ENGL J MED, V364, p[1, 33]
[7]
Mauve: Multiple alignment of conserved genomic sequence with rearrangements [J].
Darling, ACE ;
Mau, B ;
Blattner, FR ;
Perna, NT .
GENOME RESEARCH, 2004, 14 (07) :1394-1403
[8]
Evaluation of pulsed-field gel electrophoresis as a tool for determining the degree of genetic relatedness between strains of Escherichia coli O157:H7 [J].
Davis, MA ;
Hancock, DD ;
Besser, TE ;
Call, DR .
JOURNAL OF CLINICAL MICROBIOLOGY, 2003, 41 (05) :1843-1849
[9]
A Whole-Genome Single Nucleotide Polymorphism-Based Approach To Trace and Identify Outbreaks Linked to a Common Salmonella enterica subsp enterica Serovar Montevideo Pulsed-Field Gel Electrophoresis Type [J].
den Bakker, Henk C. ;
Switt, Andrea I. Moreno ;
Cummings, Craig A. ;
Hoelzer, Karin ;
Degoricija, Lovorka ;
Rodriguez-Rivera, Lorraine D. ;
Wright, Emily M. ;
Fang, Rixun ;
Davis, Margaret ;
Root, Tim ;
Schoonmaker-Bopp, Dianna ;
Musser, Kimberlee A. ;
Villamil, Elizabeth ;
Waechter, HaeNa ;
Kornstein, Laura ;
Furtado, Manohar R. ;
Wiedmann, Martin .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2011, 77 (24) :8648-8655
[10]
MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797