Kraken: ultrafast metagenomic sequence classification using exact alignments

被引:2975
作者
Wood, Derrick E. [1 ,2 ,3 ]
Salzberg, Steven L. [3 ,4 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[2] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
[3] Johns Hopkins Univ, Sch Med, McKusick Nathans Inst Genet Med, Ctr Computat Biol, Baltimore, MD USA
[4] Johns Hopkins Univ, Dept Biostat, Bloomberg Sch Publ Hlth, Baltimore, MD 21205 USA
基金
美国国家卫生研究院;
关键词
metagenomics; sequence classification; sequence alignment; next-generation sequencing; microbiome;
D O I
10.1186/gb-2014-15-3-r46
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Kraken is an ultrafast and highly accurate program for assigning taxonomic labels to metagenomic DNA sequences. Previous programs designed for this task have been relatively slow and computationally expensive, forcing researchers to use faster abundance estimation programs, which only classify small subsets of metagenomic data. Using exact alignment of k-mers, Kraken achieves classification accuracy comparable to the fastest BLAST program. In its fastest mode, Kraken classifies 100 base pair reads at a rate of over 4.1 million reads per minute, 909 times faster than Megablast and 11 times faster than the abundance estimation program MetaPhlAn. Kraken is available at http://ccb.jhu.edu/software/kraken/.
引用
收藏
页数:12
相关论文
共 24 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Scalable metagenomic taxonomy classification using a reference genome database [J].
Ames, Sasha K. ;
Hysom, David A. ;
Gardner, Shea N. ;
Lloyd, G. Scott ;
Gokhale, Maya B. ;
Allen, Jonathan E. .
BIOINFORMATICS, 2013, 29 (18) :2253-2260
[3]   PhymmBL expanded: confidence scores, custom databases, parallelization and more [J].
Brady, Arthur ;
Salzberg, Steven .
NATURE METHODS, 2011, 8 (05) :367-367
[4]  
Brady A, 2009, NAT METHODS, V6, P673, DOI [10.1038/nmeth.1358, 10.1038/NMETH.1358]
[5]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[6]   ECOLOGY OF HAEMOPHILUS-INFLUENZAE AND HAEMOPHILUS-PARAINFLUENZAE IN SPUTUM AND SALIVA AND EFFECTS OF ANTIBIOTICS ON THEIR DISTRIBUTION IN PATIENTS WITH LOWER RESPIRATORY-TRACT INFECTIONS [J].
FOWERAKER, JE ;
COOKE, NJ ;
HAWKEY, PM .
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 1993, 37 (04) :804-809
[7]  
Holtgrewe M, MASON
[8]   MEGAN analysis of metagenomic data [J].
Huson, Daniel H. ;
Auch, Alexander F. ;
Qi, Ji ;
Schuster, Stephan C. .
GENOME RESEARCH, 2007, 17 (03) :377-386
[9]   Structure, function and diversity of the healthy human microbiome [J].
Huttenhower, Curtis ;
Gevers, Dirk ;
Knight, Rob ;
Abubucker, Sahar ;
Badger, Jonathan H. ;
Chinwalla, Asif T. ;
Creasy, Heather H. ;
Earl, Ashlee M. ;
FitzGerald, Michael G. ;
Fulton, Robert S. ;
Giglio, Michelle G. ;
Hallsworth-Pepin, Kymberlie ;
Lobos, Elizabeth A. ;
Madupu, Ramana ;
Magrini, Vincent ;
Martin, John C. ;
Mitreva, Makedonka ;
Muzny, Donna M. ;
Sodergren, Erica J. ;
Versalovic, James ;
Wollam, Aye M. ;
Worley, Kim C. ;
Wortman, Jennifer R. ;
Young, Sarah K. ;
Zeng, Qiandong ;
Aagaard, Kjersti M. ;
Abolude, Olukemi O. ;
Allen-Vercoe, Emma ;
Alm, Eric J. ;
Alvarado, Lucia ;
Andersen, Gary L. ;
Anderson, Scott ;
Appelbaum, Elizabeth ;
Arachchi, Harindra M. ;
Armitage, Gary ;
Arze, Cesar A. ;
Ayvaz, Tulin ;
Baker, Carl C. ;
Begg, Lisa ;
Belachew, Tsegahiwot ;
Bhonagiri, Veena ;
Bihan, Monika ;
Blaser, Martin J. ;
Bloom, Toby ;
Bonazzi, Vivien ;
Brooks, J. Paul ;
Buck, Gregory A. ;
Buhay, Christian J. ;
Busam, Dana A. ;
Campbell, Joseph L. .
NATURE, 2012, 486 (7402) :207-214
[10]   Salivary proteins promote proteolytic activity in Streptococcus mitis biovar 2 and Streptococcus mutans [J].
Kindblom, C. ;
Davies, J. R. ;
Herzberg, M. C. ;
Svensater, G. ;
Wickstrom, C. .
MOLECULAR ORAL MICROBIOLOGY, 2012, 27 (05) :362-372