Efficient oligonucleotide probe selection for pan-genomic tiling arrays

被引:15
作者
Phillippy, Adam M. [1 ]
Deng, Xiangyu [2 ]
Zhang, Wei [2 ]
Salzberg, Steven L. [1 ]
机构
[1] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
[2] IIT, Natl Ctr Food Safety & Technol, Summit Argo, IL 60501 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
LISTERIA-MONOCYTOGENES; CGH DATA; IDENTIFICATION; DESIGN; POLYMORPHISMS; SEGMENTATION; VIRULENCE; LINEAGES; STRAINS;
D O I
10.1186/1471-2105-10-293
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Array comparative genomic hybridization is a fast and cost-effective method for detecting, genotyping, and comparing the genomic sequence of unknown bacterial isolates. This method, as with all microarray applications, requires adequate coverage of probes targeting the regions of interest. An unbiased tiling of probes across the entire length of the genome is the most flexible design approach. However, such a whole-genome tiling requires that the genome sequence is known in advance. For the accurate analysis of uncharacterized bacteria, an array must query a fully representative set of sequences from the species' pan-genome. Prior microarrays have included only a single strain per array or the conserved sequences of gene families. These arrays omit potentially important genes and sequence variants from the pan-genome. Results: This paper presents a new probe selection algorithm (PanArray) that can tile multiple whole genomes using a minimal number of probes. Unlike arrays built on clustered gene families, PanArray uses an unbiased, probe-centric approach that does not rely on annotations, gene clustering, or multi-alignments. Instead, probes are evenly tiled across all sequences of the pan-genome at a consistent level of coverage. To minimize the required number of probes, probes conserved across multiple strains in the pan-genome are selected first, and additional probes are used only where necessary to span polymorphic regions of the genome. The viability of the algorithm is demonstrated by array designs for seven different bacterial pan-genomes and, in particular, the design of a 385,000 probe array that fully tiles the genomes of 20 different Listeria monocytogenes strains with overlapping probes at greater than twofold coverage. Conclusion: PanArray is an oligonucleotide probe selection algorithm for tiling multiple genome sequences using a minimal number of probes. It is capable of fully tiling all genomes of a species on a single microarray chip. These unique pan-genome tiling arrays provide maximum flexibility for the analysis of both known and uncharacterized strains.
引用
收藏
页数:11
相关论文
共 42 条
[1]   Direct selection of human genomic loci by microarray hybridization [J].
Albert, Thomas J. ;
Molla, Michael N. ;
Muzny, Donna M. ;
Nazareth, Lynne ;
Wheeler, David ;
Song, Xingzhi ;
Richmond, Todd A. ;
Middle, Chris M. ;
Rodesch, Matthew J. ;
Packard, Charles J. ;
Weinstock, George M. ;
Gibbs, Richard A. .
NATURE METHODS, 2007, 4 (11) :903-905
[2]  
[Anonymous], 1999, COMPLEXITY APPROXIMA, DOI DOI 10.1007/978-3-642-58412-1
[3]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness
[4]   NCBI GEO: archive for high-throughput functional genomic data [J].
Barrett, Tanya ;
Troup, Dennis B. ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Rudnev, Dmitry ;
Evangelista, Carlos ;
Kim, Irene F. ;
Soboleva, Alexandra ;
Tomashevsky, Maxim ;
Marshall, Kimberly A. ;
Phillippy, Katherine H. ;
Sherman, Patti M. ;
Muertter, Rolf N. ;
Edgar, Ron .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D885-D890
[5]   Design optimization methods for genomic DNA tiling arrays [J].
Bertone, P ;
Trifonov, V ;
Rozowsky, JS ;
Schubert, F ;
Emanuelsson, O ;
Karro, J ;
Kao, MY ;
Snyder, M ;
Gerstein, M .
GENOME RESEARCH, 2006, 16 (02) :271-281
[6]   Selective discrimination of Listeria monocytogenes epidemic strains by a mixed-genome DNA microarray compared to discrimination by pulsed-field gel electrophoresis, ribotyping, and multilocus sequence typing [J].
Borucki, MK ;
Kim, SH ;
Call, DR ;
Smole, SC ;
Pagotto, F .
JOURNAL OF CLINICAL MICROBIOLOGY, 2004, 42 (11) :5270-5276
[7]   Mixed-genome microarrays reveal multiple serotype and lineage-specific differences among strains of Listeria monocytogenes [J].
Call, DR ;
Borucki, MK ;
Besser, TE .
JOURNAL OF CLINICAL MICROBIOLOGY, 2003, 41 (02) :632-639
[8]   Design of long oligonucleotide probes for functional gene detection in a microbial community [J].
Chung, WH ;
Rhee, SK ;
Bae, JW ;
Quan, ZX ;
Park, YH .
BIOINFORMATICS, 2005, 21 (22) :4092-4100
[9]   New aspects regarding evolution and virulence of Listeria monocytogenes revealed by comparative genomics and DNA arrays [J].
Doumith, M ;
Cazalet, C ;
Simoes, N ;
Frangeul, L ;
Jacquet, C ;
Kunst, F ;
Martin, P ;
Cossart, P ;
Glaser, P ;
Buchrieser, C .
INFECTION AND IMMUNITY, 2004, 72 (02) :1072-1083
[10]   LISTERIA-MONOCYTOGENES, A FOOD-BORNE PATHOGEN [J].
FARBER, JM ;
PETERKIN, PI .
MICROBIOLOGICAL REVIEWS, 1991, 55 (03) :476-511