TagGD: Fast and Accurate Software for DNA Tag Generation and Demultiplexing

被引:38
作者
Costea, Paul Igor [1 ]
Lundeberg, Joakim [1 ]
Akan, Pelin [1 ]
机构
[1] KTH Royal Inst Technol, Sci Life Lab, Sch Biotechnol, Solna, Sweden
关键词
BARCODED PRIMERS; DISSECTION; ENHANCERS; DESIGN;
D O I
10.1371/journal.pone.0057521
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
Multiplexing is of vital importance for utilizing the full potential of next generation sequencing technologies. We here report TagGD (DNA-based Tag Generator and Demultiplexor), a fully-customisable, fast and accurate software package that can generate thousands of barcodes satisfying user-defined constraints and can guarantee full demultiplexing accuracy. The barcodes are designed to minimise their interference with the experiment. Insertion, deletion and substitution events are considered when designing and demultiplexing barcodes. 20,000 barcodes of length 18 were designed in 5 minutes and 2 million barcoded Illumina HiSeq-like reads generated with an error rate of 2% were demultiplexed with full accuracy in 5 minutes. We believe that our software meets a central demand in the current high-throughput biology and can be utilised in any field with ample sample abundance. The software is available on GitHub (https://github.com/pelinakan/UBD.git).
引用
收藏
页数:5
相关论文
共 15 条
[1]
[Anonymous], 2007, BOUNDS MINIMUM DISTA
[2]
Butenhof DR., 1997, Programming with POSIX Threads
[3]
Generalized DNA Barcode Design Based on Hamming Codes [J].
Bystrykh, Leonid V. .
PLOS ONE, 2012, 7 (05)
[4]
OpenMP: An industry standard API for shared-memory programming [J].
Dagum, L ;
Menon, R .
IEEE COMPUTATIONAL SCIENCE & ENGINEERING, 1998, 5 (01) :46-55
[5]
DNA Sudoku-harnessing high-throughput sequencing for multiplexed specimen analysis [J].
Erlich, Yaniv ;
Chang, Kenneth ;
Gordon, Assaf ;
Ronen, Roy ;
Navon, Oron ;
Rooks, Michelle ;
Hannon, Gregory J. .
GENOME RESEARCH, 2009, 19 (07) :1243-1253
[6]
BARCRAWL and BARTAB: software tools for the design and implementation of barcoded primers for highly multiplexed DNA sequencing [J].
Frank, Daniel N. .
BMC BIOINFORMATICS, 2009, 10 :362
[7]
Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplex [J].
Hamady, Micah ;
Walker, Jeffrey J. ;
Harris, J. Kirk ;
Gold, Nicholas J. ;
Knight, Rob .
NATURE METHODS, 2008, 5 (03) :235-237
[8]
Levenshtein V.I., 1966, Soviet Physics Doklady
[9]
Clusters of Nucleotide Substitutions and Insertion/Deletion Mutations Are Associated with Repeat Sequences [J].
McDonald, Michael J. ;
Wang, Wei-Chi ;
Huang, Hsien-Da ;
Leu, Jun-Yi .
PLOS BIOLOGY, 2011, 9 (06)
[10]
Systematic dissection and optimization of inducible enhancers in human cells using a massively parallel reporter assay [J].
Melnikov, Alexandre ;
Murugan, Anand ;
Zhang, Xiaolan ;
Tesileanu, Tiberiu ;
Wang, Li ;
Rogov, Peter ;
Feizi, Soheil ;
Gnirke, Andreas ;
Callan, Curtis G., Jr. ;
Kinney, Justin B. ;
Kellis, Manolis ;
Lander, Eric S. ;
Mikkelsen, Tarjei S. .
NATURE BIOTECHNOLOGY, 2012, 30 (03) :271-+