SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing

被引:17842
作者
Bankevich, Anton [2 ]
Nurk, Sergey [2 ]
Antipov, Dmitry [2 ]
Gurevich, Alexey A. [2 ]
Dvorkin, Mikhail [2 ]
Kulikov, Alexander S. [2 ,3 ]
Lesin, Valery M. [2 ]
Nikolenko, Sergey I. [2 ,3 ]
Son Pham [4 ]
Prjibelski, Andrey D. [2 ]
Pyshkin, Alexey V. [2 ]
Sirotkin, Alexander V. [2 ]
Vyahhi, Nikolay [2 ]
Tesler, Glenn [5 ]
Alekseyev, Max A. [1 ,2 ]
Pevzner, Pavel A. [2 ,4 ]
机构
[1] Univ S Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] St Petersburg Acad Univ, Russian Acad Sci, Algorithm Biol Lab, St Petersburg, Russia
[3] VA Steklov Math Inst, St Petersburg 191011, Russia
[4] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[5] Univ Calif San Diego, Dept Math, La Jolla, CA 92093 USA
基金
美国国家卫生研究院;
关键词
assembly; de Bruijn graph; single cell; sequencing; bacteria; DE-BRUIJN GRAPHS; BACTERIAL GENOMES; AMPLIFICATION; MATTER;
D O I
10.1089/cmb.2012.0021
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of un-cultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online (http://bioinf.spbau.ru/spades). It is distributed as open source software.
引用
收藏
页码:455 / 477
页数:23
相关论文
共 40 条
  • [1] A new approach to sequence comparison:: normalired sequence alignment
    Arslan, AN
    Egecioglu, Ö
    Pevzner, PA
    [J]. BIOINFORMATICS, 2001, 17 (04) : 327 - 337
  • [2] Shotgun protein sequencing - Assembly of peptide tandem mass spectra from mixtures of modified proteins
    Bandeira, Nuno
    Clauser, Karl R.
    Pevzner, Pavel A.
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (07) : 1123 - 1134
  • [3] Automated de novo protein sequencing of monoclonal antibodies
    Bandeira, Nuno
    Pham, Victoria
    Pevzner, Pavel
    Arnott, David
    Lill, Jennie R.
    [J]. NATURE BIOTECHNOLOGY, 2008, 26 (12) : 1336 - 1338
  • [4] Genome of a Low-Salinity Ammonia-Oxidizing Archaeon Determined by Single-Cell and Metagenomic Analysis
    Blainey, Paul C.
    Mosier, Annika C.
    Potanina, Anastasia
    Francis, Christopher A.
    Quake, Stephen R.
    [J]. PLOS ONE, 2011, 6 (02):
  • [5] ALLPATHS: De novo assembly of whole-genome shotgun microreads
    Butler, Jonathan
    MacCallum, Iain
    Kleber, Michael
    Shlyakhter, Ilya A.
    Belmonte, Matthew K.
    Lander, Eric S.
    Nusbaum, Chad
    Jaffe, David B.
    [J]. GENOME RESEARCH, 2008, 18 (05) : 810 - 820
  • [6] Short read fragment assembly of bacterial genomes
    Chaisson, Mark J.
    Pevzner, Pavel A.
    [J]. GENOME RESEARCH, 2008, 18 (02) : 324 - 330
  • [7] De novo fragment assembly with short mate-paired reads: Does the read length matter?
    Chaisson, Mark J.
    Brinza, Dumitru
    Pevzner, Pavel A.
    [J]. GENOME RESEARCH, 2009, 19 (02) : 336 - 346
  • [8] Chikhi Rayan, 2011, Algorithms in Bioinformatics. Proceedings of the 11th International Workshop, WABI 2011, P39, DOI 10.1007/978-3-642-23038-7_4
  • [9] Efficient de novo assembly of single-cell bacterial genomes from short-read data sets
    Chitsaz, Hamidreza
    Yee-Greenbaum, Joyclyn L.
    Tesler, Glenn
    Lombardo, Mary-Jane
    Dupont, Christopher L.
    Badger, Jonathan H.
    Novotny, Mark
    Rusch, Douglas B.
    Fraser, Louise J.
    Gormley, Niall A.
    Schulz-Trieglaff, Ole
    Smith, Geoffrey P.
    Evers, Dirk J.
    Pevzner, Pavel A.
    Lasken, Roger S.
    [J]. NATURE BIOTECHNOLOGY, 2011, 29 (10) : 915 - U214
  • [10] Single-cell dissection of transcriptional heterogeneity in human colon tumors
    Dalerba, Piero
    Kalisky, Tomer
    Sahoo, Debashis
    Rajendran, Pradeep S.
    Rothenberg, Michael E.
    Leyrat, Anne A.
    Sim, Sopheak
    Okamoto, Jennifer
    Johnston, Darius M.
    Qian, Dalong
    Zabala, Maider
    Bueno, Janet
    Neff, Norma F.
    Wang, Jianbin
    Shelton, Andrew A.
    Visser, Brendan
    Hisamori, Shigeo
    Shimono, Yohei
    van de Wetering, Marc
    Clevers, Hans
    Clarke, Michael F.
    Quake, Stephen R.
    [J]. NATURE BIOTECHNOLOGY, 2011, 29 (12) : 1120 - U11