Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

被引:16373
作者
Langmead, Ben [1 ]
Trapnell, Cole [1 ]
Pop, Mihai [1 ]
Salzberg, Steven L. [1 ]
机构
[1] Univ Maryland, Ctr Bioinformat & Computat Biol, Inst Adv Comp Studies, College Pk, MD 20742 USA
来源
GENOME BIOLOGY | 2009年 / 10卷 / 03期
基金
美国国家科学基金会;
关键词
IDENTIFICATION; SPACE;
D O I
10.1186/gb-2009-10-3-r25
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Bowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of approximately 1.3 gigabytes. Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches. Multiple processor cores can be used simultaneously to achieve even greater alignment speeds. Bowtie is open source http://bowtie.cbcb.umd.edu.
引用
收藏
页数:10
相关论文
共 30 条
[1]  
[Anonymous], SHRIMP SHORT READ MA
[2]   Fast and practical approximate string matching [J].
BaezaYates, RA ;
Perleberg, CH .
INFORMATION PROCESSING LETTERS, 1996, 59 (01) :21-27
[3]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[4]  
*BOWT, BOWT ULTR MEM EFF SH
[5]  
Burkhardt S, 2003, FUND INFORM, V56, P51
[6]  
Burrows M, 1994, BLOCK SORTING LOSSLE
[7]   Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing [J].
Campbell, Peter J. ;
Stephens, Philip J. ;
Pleasance, Erin D. ;
O'Meara, Sarah ;
Li, Heng ;
Santarius, Thomas ;
Stebbings, Lucy A. ;
Leroy, Catherine ;
Edkins, Sarah ;
Hardy, Claire ;
Teague, Jon W. ;
Menzies, Andrew ;
Goodhead, Ian ;
Turner, Daniel J. ;
Clee, Christopher M. ;
Quail, Michael A. ;
Cox, Antony ;
Brown, Clive ;
Durbin, Richard ;
Hurles, Matthew E. ;
Edwards, Paul A. W. ;
Bignell, Graham R. ;
Stratton, Michael R. ;
Futreal, P. Andrew .
NATURE GENETICS, 2008, 40 (06) :722-729
[8]   SeqAn An efficient, generic C++ library for sequence analysis [J].
Doering, Andreas ;
Weese, David ;
Rausch, Tobias ;
Reinert, Knut .
BMC BIOINFORMATICS, 2008, 9 (1)
[9]   A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis [J].
Down, Thomas A. ;
Rakyan, Vardhman K. ;
Turner, Daniel J. ;
Flicek, Paul ;
Li, Heng ;
Kulesha, Eugene ;
Graf, Stefan ;
Johnson, Nathan ;
Herrero, Javier ;
Tomazou, Eleni M. ;
Thorne, Natalie P. ;
Backdahl, Liselotte ;
Herberth, Marlis ;
Howe, Kevin L. ;
Jackson, David K. ;
Miretti, Marcos M. ;
Marioni, John C. ;
Birney, Ewan ;
Hubbard, Tim J. P. ;
Durbin, Richard ;
Tavare, Simon ;
Beck, Stephan .
NATURE BIOTECHNOLOGY, 2008, 26 (07) :779-785
[10]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194