deML: robust demultiplexing of Illumina sequences using a likelihood-based approach

被引:157
作者
Renaud, Gabriel [1 ]
Stenzel, Udo [1 ]
Maricic, Tomislav [1 ]
Wiebe, Victor [1 ]
Kelso, Janet [1 ]
机构
[1] Max Planck Inst Evolutionary Anthropol, Dept Evolutionary Genet, D-04103 Leipzig, Saxony, Germany
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1093/bioinformatics/btu719
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Motivation: Pooling multiple samples increases the efficiency and lowers the cost of DNA sequencing. One approach to multiplexing is to use short DNA indices to uniquely identify each sample. After sequencing, reads must be assigned in silico to the sample of origin, a process referred to as demultiplexing. Demultiplexing software typically identifies the sample of origin using a fixed number of mismatches between the read index and a reference index set. This approach may fail or misassign reads when the sequencing quality of the indices is poor. Results: We introduce deML, a maximum likelihood algorithm that demultiplexes Illumina sequences. deML computes the likelihood of an observed index sequence being derived from a specified sample. A quality score which reflects the probability of the assignment being correct is generated for each read. Using these quality scores, even very problematic datasets can be demultiplexed and an error threshold can be set.
引用
收藏
页码:770 / 772
页数:3
相关论文
共 5 条
[1]
TagGD: Fast and Accurate Software for DNA Tag Generation and Demultiplexing [J].
Costea, Paul Igor ;
Lundeberg, Joakim ;
Akan, Pelin .
PLOS ONE, 2013, 8 (03)
[2]
Kraken: A set of tools for quality control and analysis Of high-throughput sequence data [J].
Davis, Matthew P. A. ;
van Dongen, Stijn ;
Abreu-Goodger, Cei ;
Bartonicek, Nenad ;
Enright, Anton J. .
METHODS, 2013, 63 (01) :41-49
[3]
Dodt Matthias, 2012, Biology (Basel), V1, P895, DOI 10.3390/biology1030895
[4]
Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform [J].
Kircher, Martin ;
Sawyer, Susanna ;
Meyer, Matthias .
NUCLEIC ACIDS RESEARCH, 2012, 40 (01) :e3
[5]
Launching genomics into the cloud: deployment of Mercury, a next generation sequence analysis pipeline [J].
Reid, Jeffrey G. ;
Carroll, Andrew ;
Veeraraghavan, Narayanan ;
Dahdouli, Mahmoud ;
Sundquist, Andreas ;
English, Adam ;
Bainbridge, Matthew ;
White, Simon ;
Salerno, William ;
Buhay, Christian ;
Yu, Fuli ;
Muzny, Donna ;
Daly, Richard ;
Duyk, Geoff ;
Gibbs, Richard A. ;
Boerwinkle, Eric .
BMC BIOINFORMATICS, 2014, 15