Generalized DNA Barcode Design Based on Hamming Codes

被引:63
作者
Bystrykh, Leonid V. [1 ]
机构
[1] Univ Groningen, Univ Med Ctr Groningen, European Res Inst Biol Ageing, Sect Stem Cell Biol, Groningen, Netherlands
关键词
D O I
10.1371/journal.pone.0036852
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
The diversity and scope of multiplex parallel sequencing applications is steadily increasing. Critically, multiplex parallel sequencing applications methods rely on the use of barcoded primers for sample identification, and the quality of the barcodes directly impacts the quality of the resulting sequence data. Inspection of the recent publications reveals a surprisingly variable quality of the barcodes employed. Some barcodes are made in a semi empirical fashion, without quantitative consideration of error correction or minimal distance properties. After systematic comparison of published barcode sets, including commercially distributed barcoded primers from Illumina and Epicentre, methods for improved, Hamming code-based sequences are suggested and illustrated. Hamming barcodes can be employed for DNA tag designs in many different ways while preserving minimal distance and error-correcting properties. In addition, Hamming barcodes remain flexible with regard to essential biological parameters such as sequence redundancy and GC content. Wider adoption of improved Hamming barcodes is encouraged in multiplex parallel sequencing applications.
引用
收藏
页数:8
相关论文
共 21 条
[1]
Universal DNA tag systems: A combinatorial design scheme [J].
Ben-Dor, A ;
Karp, R ;
Schwikowski, B ;
Yakhini, Z .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :503-519
[2]
The Use of Coded PCR Primers Enables High-Throughput Sequencing of Multiple Homolog Amplification Products by 454 Parallel Sequencing [J].
Binladen, Jonas ;
Gilbert, M. Thomas P. ;
Bollback, Jonathan P. ;
Panitz, Frank ;
Bendixen, Christian ;
Nielsen, Rasmus ;
Willerslev, Eske .
PLOS ONE, 2007, 2 (02)
[3]
Normalization and subtraction: Two approaches to facilitate gene discovery [J].
Bonaldo, MDF ;
Lennon, G ;
Soares, MB .
GENOME RESEARCH, 1996, 6 (09) :791-806
[4]
Craig DW, 2008, NAT METHODS, V5, P887, DOI [10.1038/nmeth.1251, 10.1038/NMETH.1251]
[5]
DNA Sudoku-harnessing high-throughput sequencing for multiplexed specimen analysis [J].
Erlich, Yaniv ;
Chang, Kenneth ;
Gordon, Assaf ;
Ronen, Roy ;
Navon, Oron ;
Rooks, Michelle ;
Hannon, Gregory J. .
GENOME RESEARCH, 2009, 19 (07) :1243-1253
[6]
BARCRAWL and BARTAB: software tools for the design and implementation of barcoded primers for highly multiplexed DNA sequencing [J].
Frank, Daniel N. .
BMC BIOINFORMATICS, 2009, 10 :362
[7]
Demonstration of a word design strategy for DNA computing on surfaces [J].
Frutos, AG ;
Liu, QH ;
Thiel, AJ ;
Sanner, AMW ;
Condon, AE ;
Smith, LM ;
Corn, RM .
NUCLEIC ACIDS RESEARCH, 1997, 25 (23) :4748-4757
[8]
A 454 multiplex sequencing method for rapid and reliable genotyping of highly polymorphic genes in large-scale studies [J].
Galan, Maxime ;
Guivier, Emmanuel ;
Caraux, Gilles ;
Charbonnel, Nathalie ;
Cosson, Jean-Francois .
BMC GENOMICS, 2010, 11
[9]
Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing [J].
Gilles, Andre ;
Meglecz, Emese ;
Pech, Nicolas ;
Ferreira, Stephanie ;
Malausa, Thibaut ;
Martin, Jean-Francois .
BMC GENOMICS, 2011, 12
[10]
Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplex [J].
Hamady, Micah ;
Walker, Jeffrey J. ;
Harris, J. Kirk ;
Gold, Nicholas J. ;
Knight, Rob .
NATURE METHODS, 2008, 5 (03) :235-237