Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms

被引:392
作者
Patro, Rob [1 ]
Mount, Stephen M. [2 ,3 ]
Kingsford, Carl [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Lane Ctr Computat Biol, Pittsburgh, PA 15213 USA
[2] Univ Maryland, Dept Cell Biol & Mol Genet, College Pk, MD 20742 USA
[3] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
DIFFERENTIAL EXPRESSION; SEQUENCES;
D O I
10.1038/nbt.2862
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We introduce Sailfish, a computational method for quantifying the abundance of previously annotated RNA isoforms from RNA-seq data. Because Sailfish entirely avoids mapping reads, a time-consuming step in all current methods, it provides quantification estimates much faster than do existing approaches (typically 20 times faster) without loss of accuracy. By facilitating frequent reanalysis of data and reducing the need to optimize parameters, Sailfish exemplifies the potential of lightweight algorithms for efficiently processing sequencing reads.
引用
收藏
页码:462 / U174
页数:6
相关论文
共 21 条
[1]  
Botelho F.C., 2007, P 10 INT WORKSH ALG, P139
[2]   Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments [J].
Bullard, James H. ;
Purdom, Elizabeth ;
Hansen, Kasper D. ;
Dudoit, Sandrine .
BMC BIOINFORMATICS, 2010, 11
[3]   Ensembl 2013 [J].
Flicek, Paul ;
Ahmed, Ikhlak ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fairley, Susan ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Garcia-Giron, Carlos ;
Gordon, Leo ;
Hourlier, Thibaut ;
Hunt, Sarah ;
Juettemann, Thomas ;
Kaehaeri, Andreas K. ;
Keenan, Stephen ;
Komorowska, Monika ;
Kulesha, Eugene ;
Longden, Ian ;
Maurel, Thomas ;
McLaren, William M. ;
Muffato, Matthieu ;
Nag, Rishi ;
Overduin, Bert ;
Pignatelli, Miguel ;
Pritchard, Bethan ;
Pritchard, Emily ;
Riat, Harpreet Singh ;
Ritchie, Graham R. S. ;
Ruffier, Magali ;
Schuster, Michael ;
Sheppard, Daniel ;
Sobral, Daniel ;
Taylor, Kieron ;
Thormann, Anja ;
Trevanion, Stephen ;
White, Simon ;
Wilder, Steven P. ;
Aken, Bronwen L. ;
Birney, Ewan ;
Cunningham, Fiona ;
Dunham, Ian ;
Harrow, Jennifer ;
Herrero, Javier ;
Hubbard, Tim J. P. ;
Johnson, Nathan ;
Kinsella, Rhoda .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D48-D55
[4]   Full-length transcriptome assembly from RNA-Seq data without a reference genome [J].
Grabherr, Manfred G. ;
Haas, Brian J. ;
Yassour, Moran ;
Levin, Joshua Z. ;
Thompson, Dawn A. ;
Amit, Ido ;
Adiconis, Xian ;
Fan, Lin ;
Raychowdhury, Raktima ;
Zeng, Qiandong ;
Chen, Zehua ;
Mauceli, Evan ;
Hacohen, Nir ;
Gnirke, Andreas ;
Rhind, Nicholas ;
di Palma, Federica ;
Birren, Bruce W. ;
Nusbaum, Chad ;
Lindblad-Toh, Kerstin ;
Friedman, Nir ;
Regev, Aviv .
NATURE BIOTECHNOLOGY, 2011, 29 (07) :644-U130
[5]   Modelling and simulating generic RNA-Seq experiments with the flux simulator [J].
Griebel, Thasso ;
Zacher, Benedikt ;
Ribeca, Paolo ;
Raineri, Emanuele ;
Lacroix, Vincent ;
Guigo, Roderic ;
Sammeth, Michael .
NUCLEIC ACIDS RESEARCH, 2012, 40 (20) :10073-10083
[6]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[7]   RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome [J].
Li, Bo ;
Dewey, Colin N. .
BMC BIOINFORMATICS, 2011, 12
[8]   A fast, lock-free approach for efficient parallel counting of occurrences of k-mers [J].
Marcais, Guillaume ;
Kingsford, Carl .
BIOINFORMATICS, 2011, 27 (06) :764-770
[9]  
Mortazavi A., Nat Methods
[10]   Estimation of alternative splicing isoform frequencies from RNA-Seq data [J].
Nicolae, Marius ;
Mangul, Serghei ;
Mandoiu, Ion I. ;
Zelikovsky, Alex .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6