Improving draft assemblies by iterative mapping and assembly of short reads to eliminate gaps

被引:225
作者
Tsai, Isheng J. [1 ]
Otto, Thomas D. [1 ]
Berriman, Matthew [1 ]
机构
[1] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
来源
GENOME BIOLOGY | 2010年 / 11卷 / 04期
关键词
SEQUENCE DATA; PROGRAM;
D O I
10.1186/gb-2010-11-4-r41
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Advances in sequencing technology allow genomes to be sequenced at vastly decreased costs. However, the assembled data frequently are highly fragmented with many gaps. We present a practical approach that uses Illumina sequences to improve draft genome assemblies by aligning sequences against contig ends and performing local assemblies to produce gap-spanning contigs. The continuity of a draft genome can thus be substantially improved, often without the need to generate new data.
引用
收藏
页数:9
相关论文
共 21 条
[11]  
Hillier L, 1991, PCR Methods Appl, V1, P124
[12]   The Sequence Alignment/Map format and SAMtools [J].
Li, Heng ;
Handsaker, Bob ;
Wysoker, Alec ;
Fennell, Tim ;
Ruan, Jue ;
Homer, Nils ;
Marth, Gabor ;
Abecasis, Goncalo ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (16) :2078-2079
[13]   De novo assembly of human genomes with massively parallel short read sequencing [J].
Li, Ruiqiang ;
Zhu, Hongmei ;
Ruan, Jue ;
Qian, Wubin ;
Fang, Xiaodong ;
Shi, Zhongbin ;
Li, Yingrui ;
Li, Shengting ;
Shan, Gao ;
Kristiansen, Karsten ;
Li, Songgang ;
Yang, Huanming ;
Wang, Jian ;
Wang, Jun .
GENOME RESEARCH, 2010, 20 (02) :265-272
[14]   ALLPATHS 2: small genomes assembled accurately and with high continuity from short paired reads [J].
MacCallum, Iain ;
Przybylski, Dariusz ;
Gnerre, Sante ;
Burton, Joshua ;
Shlyakhter, Ilya ;
Gnirke, Andreas ;
Malek, Joel ;
McKernan, Kevin ;
Ranade, Swati ;
Shea, Terrance P. ;
Williams, Louise ;
Young, Sarah ;
Nusbaum, Chad ;
Jaffe, David B. .
GENOME BIOLOGY, 2009, 10 (10)
[15]   The phusion assembler [J].
Mullikin, JC ;
Ning, ZM .
GENOME RESEARCH, 2003, 13 (01) :81-90
[16]   SSAHA: A fast search method for large DNA databases [J].
Ning, ZM ;
Cox, AJ ;
Mullikin, JC .
GENOME RESEARCH, 2001, 11 (10) :1725-1729
[17]   De novo assembly using low-coverage short read sequence data from the rice pathogen Pseudomonas syringae pv. oryzae [J].
Reinhardt, Josephine A. ;
Baltrus, David A. ;
Nishimura, Marc T. ;
Jeck, William R. ;
Jones, Corbin D. ;
Dangl, Jeffery L. .
GENOME RESEARCH, 2009, 19 (02) :294-305
[18]   ABySS: A parallel assembler for short read sequence data [J].
Simpson, Jared T. ;
Wong, Kim ;
Jackman, Shaun D. ;
Schein, Jacqueline E. ;
Jones, Steven J. M. ;
Birol, Inanc .
GENOME RESEARCH, 2009, 19 (06) :1117-1123
[19]  
*WA U GEN SEQ CTR, 1998, GENOME RES, V8, P1097
[20]   Velvet: Algorithms for de novo short read assembly using de Bruijn graphs [J].
Zerbino, Daniel R. ;
Birney, Ewan .
GENOME RESEARCH, 2008, 18 (05) :821-829