Extending assembly of short DNA sequences to handle error

被引:141
作者
Jeck, William R.
Reinhardt, Josephine A.
Baltrus, David A.
Hickenbotham, Matthew T.
Magrini, Vincent
Mardis, Elaine R.
Dangl, Jeffery L.
Jones, Corbin D.
机构
[1] Univ N Carolina, Dept Biol, Chapel Hill, NC 27599 USA
[2] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63108 USA
[3] Univ N Carolina, Carolina Ctr Genome Sci, Chapel Hill, NC 27599 USA
关键词
D O I
10.1093/bioinformatics/btm451
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Inexpensive de novo genome sequencing, particularly in organisms with small genomes, is now possible using several new sequencing technologies. Some of these technologies such as that from Illuminas Solexa Sequencing, produce high genomic coverage by generating a very large number of small reads (similar to 30 bp). While prior work shows that partial assembly can be performed by k-mer extension in error-free reads, this algorithm is unsuccessful with the sequencing error rates found in practice. We present VCAKE (Verified Consensus Assembly by K-mer Extension), a modification of simple k-mer extension that overcomes error by using high depth coverage. Though it is a simple modification of a previous approach, we show significant improvements in assembly results on simulated and experimental datasets that include error.
引用
收藏
页码:2942 / 2944
页数:3
相关论文
共 4 条
  • [1] Whole-genome re-sequencing
    Bentley, David R.
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 2006, 16 (06) : 545 - 552
  • [2] Whole-Genome Sequencing and Assembly with High-Throughput, Short-Read Technologies
    Sundquist, Andreas
    Ronaghi, Mostafa
    Tang, Haixu
    Pevzner, Pavel
    Batzoglou, Serafim
    [J]. PLOS ONE, 2007, 2 (05):
  • [3] Assembling millions of short DNA sequences using SSAKE
    Warren, Rene L.
    Sutton, Granger G.
    Jones, Steven J. M.
    Holt, Robert A.
    [J]. BIOINFORMATICS, 2007, 23 (04) : 500 - 501
  • [4] An analysis of the feasibility of short read sequencing -: art. no. E171
    Whiteford, N
    Haslam, N
    Weber, G
    Prügel-Bennett, A
    Essex, JW
    Roach, PL
    Bradley, M
    Neylon, C
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 (19) : 1 - 6