Efficient de novo assembly of single-cell bacterial genomes from short-read data sets

被引:165
作者
Chitsaz, Hamidreza [2 ]
Yee-Greenbaum, Joyclyn L. [1 ]
Tesler, Glenn [3 ]
Lombardo, Mary-Jane [1 ]
Dupont, Christopher L. [1 ]
Badger, Jonathan H. [1 ]
Novotny, Mark [1 ]
Rusch, Douglas B. [4 ]
Fraser, Louise J. [5 ]
Gormley, Niall A. [5 ]
Schulz-Trieglaff, Ole [5 ]
Smith, Geoffrey P. [5 ]
Evers, Dirk J. [5 ]
Pevzner, Pavel A. [2 ]
Lasken, Roger S. [1 ]
机构
[1] J Craig Venter Inst, San Diego, CA USA
[2] Univ Calif San Diego, Dept Comp Sci, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Math, La Jolla, CA 92093 USA
[4] J Craig Venter Inst, Rockville, MD USA
[5] Illumina Cambridge Ltd, Saffron Walden, Essex, England
基金
美国国家卫生研究院;
关键词
AMPLIFICATION; POLYMERASE; SEQUENCE;
D O I
10.1038/nbt.1966
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Whole genome amplification by the multiple displacement amplification (MDA) method allows sequencing of DNA from single cells of bacteria that cannot be cultured. Assembling a genome is challenging, however, because MDA generates highly nonuniform coverage of the genome. Here we describe an algorithm tailored for short-read data from single cells that improves assembly through the use of a progressively increasing coverage cutoff. Assembly of reads from single Escherichia coli and Staphylococcus aureus cells captures >91% of genes within contigs, approaching the 95% captured from an assembly based on many E. coli cells. We apply this method to assemble a genome from a single cell of an uncultivated SAR324 clade of Deltaproteobacteria, a cosmopolitan bacterial lineage in the global ocean. Metabolic reconstruction suggests that SAR324 is aerobic, motile and chemotaxic. Our approach enables acquisition of genome assemblies for individual uncultivated bacteria using only short reads, providing cell-specific genetic information absent from metagenomic studies.
引用
收藏
页码:915 / U214
页数:8
相关论文
共 47 条
  • [1] Accurate whole human genome sequencing using reversible terminator chemistry
    Bentley, David R.
    Balasubramanian, Shankar
    Swerdlow, Harold P.
    Smith, Geoffrey P.
    Milton, John
    Brown, Clive G.
    Hall, Kevin P.
    Evers, Dirk J.
    Barnes, Colin L.
    Bignell, Helen R.
    Boutell, Jonathan M.
    Bryant, Jason
    Carter, Richard J.
    Cheetham, R. Keira
    Cox, Anthony J.
    Ellis, Darren J.
    Flatbush, Michael R.
    Gormley, Niall A.
    Humphray, Sean J.
    Irving, Leslie J.
    Karbelashvili, Mirian S.
    Kirk, Scott M.
    Li, Heng
    Liu, Xiaohai
    Maisinger, Klaus S.
    Murray, Lisa J.
    Obradovic, Bojan
    Ost, Tobias
    Parkinson, Michael L.
    Pratt, Mark R.
    Rasolonjatovo, Isabelle M. J.
    Reed, Mark T.
    Rigatti, Roberto
    Rodighiero, Chiara
    Ross, Mark T.
    Sabot, Andrea
    Sankar, Subramanian V.
    Scally, Aylwyn
    Schroth, Gary P.
    Smith, Mark E.
    Smith, Vincent P.
    Spiridou, Anastassia
    Torrance, Peta E.
    Tzonev, Svilen S.
    Vermaas, Eric H.
    Walter, Klaudia
    Wu, Xiaolin
    Zhang, Lu
    Alam, Mohammed D.
    Anastasi, Carole
    [J]. NATURE, 2008, 456 (7218) : 53 - 59
  • [2] Comparative Bacterial Proteomics: Analysis of the Core Genome Concept
    Callister, Stephen J.
    McCue, Lee Ann
    Turse, Joshua E.
    Monroe, Matthew E.
    Auberry, Kenneth J.
    Smith, Richard D.
    Adkins, Joshua N.
    Lipton, Mary S.
    [J]. PLOS ONE, 2008, 3 (02):
  • [3] Short read fragment assembly of bacterial genomes
    Chaisson, Mark J.
    Pevzner, Pavel A.
    [J]. GENOME RESEARCH, 2008, 18 (02) : 324 - 330
  • [4] Comprehensive human genome amplification using multiple displacement amplification
    Dean, FB
    Hosono, S
    Fang, LH
    Wu, XH
    Faruqi, AF
    Bray-Ward, P
    Sun, ZY
    Zong, QL
    Du, YF
    Du, J
    Driscoll, M
    Song, WM
    Kingsmore, SF
    Egholm, M
    Lasken, RS
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (08) : 5261 - 5266
  • [5] Rapid amplification of plasmid and phage DNA using phi29 DNA polymerase and multiply-primed rolling circle amplification
    Dean, FB
    Nelson, JR
    Giesler, TL
    Lasken, RS
    [J]. GENOME RESEARCH, 2001, 11 (06) : 1095 - 1099
  • [6] Community genomics among stratified microbial assemblages in the ocean's interior
    DeLong, EF
    Preston, CM
    Mincer, T
    Rich, V
    Hallam, SJ
    Frigaard, NU
    Martinez, A
    Sullivan, MB
    Edwards, R
    Brito, BR
    Chisholm, SW
    Karl, DM
    [J]. SCIENCE, 2006, 311 (5760) : 496 - 503
  • [7] Complete genome sequence of USA300, an epidemic clone of community-acquired meticillin-resistant Staphylococcus aureus
    Diep, BA
    Gill, SR
    Chang, RF
    Phan, TH
    Chen, JH
    Davidson, MG
    Lin, F
    Lin, J
    Carleton, HA
    Mongodin, EF
    Sensabaugh, GF
    Perdreau-Remington, F
    [J]. LANCET, 2006, 367 (9512) : 731 - 739
  • [8] Metagenomic analysis of the human distal gut microbiome
    Gill, Steven R.
    Pop, Mihai
    DeBoy, Robert T.
    Eckburg, Paul B.
    Turnbaugh, Peter J.
    Samuel, Buck S.
    Gordon, Jeffrey I.
    Relman, David A.
    Fraser-Liggett, Claire M.
    Nelson, Karen E.
    [J]. SCIENCE, 2006, 312 (5778) : 1355 - 1359
  • [9] Evolution of sensory complexity recorded in a myxobacterial genome
    Goldman, B. S.
    Nierman, W. C.
    Kaiser, D.
    Slater, S. C.
    Durkin, A. S.
    Eisen, J.
    Ronning, C. M.
    Barbazuk, W. B.
    Blanchard, M.
    Field, C.
    Halling, C.
    Hinkle, G.
    Iartchuk, O.
    Kim, H. S.
    Mackenzie, C.
    Madupu, R.
    Miller, N.
    Shvartsbeyn, A.
    Sullivan, S. A.
    Vaudin, M.
    Wiegand, R.
    Kaplan, H. B.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (41) : 15200 - 15205
  • [10] De novo bacterial genome sequencing: Millions of very short reads assembled on a desktop computer
    Hernandez, David
    Francois, Patrice
    Farinelli, Laurent
    Osteras, Magne
    Schrenzel, Jacques
    [J]. GENOME RESEARCH, 2008, 18 (05) : 802 - 809