De novo assembly and analysis of RNA-seq data

被引:651
作者
Robertson, Gordon [1 ]
Schein, Jacqueline [1 ]
Chiu, Readman [1 ]
Corbett, Richard [1 ]
Field, Matthew [1 ]
Jackman, Shaun D. [1 ]
Mungall, Karen [1 ]
Lee, Sam [2 ]
Okada, Hisanaga Mark [1 ]
Qian, Jenny Q. [1 ]
Griffith, Malachi [1 ]
Raymond, Anthony [1 ]
Thiessen, Nina [1 ]
Cezard, Timothee [1 ]
Butterfield, Yaron S. [1 ]
Newsome, Richard [1 ]
Chan, Simon K. [1 ]
She, Rong [1 ]
Varhol, Richard [1 ]
Kamoh, Baljit [1 ]
Prabhu, Anna-Liisa [1 ]
Tam, Angela [1 ]
Zhao, YongJun [1 ]
Moore, Richard A. [1 ]
Hirst, Martin [1 ]
Marra, Marco A. [1 ,3 ]
Jones, Steven J. M. [1 ,3 ]
Hoodless, Pamela A. [2 ,3 ]
Birol, Inanc [1 ]
机构
[1] British Columbia Canc Agcy, Canadas Michael Smith Genome Sci Ctr, Vancouver, BC V5Z 4E6, Canada
[2] British Columbia Canc Agcy, Terry Fox Lab, Vancouver, BC V5Z 1L3, Canada
[3] Univ British Columbia, Dept Med Genet, Vancouver, BC, Canada
关键词
SPLICE JUNCTIONS; EXPRESSION; ALIGNMENT; TRANSCRIPTS; GENERATION; REVEALS; GENOMES; READS; RATES;
D O I
10.1038/nmeth.1517
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We describe Trans-ABySS, a de novo short-read transcriptome assembly and analysis pipeline that addresses variation in local read densities by assembling read substrings with varying stringencies and then merging the resulting contigs before analysis. Analyzing 7.4 gigabases of 50-base-pair paired-end Illumina reads from an adult mouse liver poly(A) RNA library, we identified known, new and alternative structures in expressed transcripts, and achieved high sensitivity and specificity relative to reference-based assembly methods.
引用
收藏
页码:909 / U62
页数:7
相关论文
共 32 条
[1]   Global and unbiased detection of splice junctions from RNA-seq data [J].
Ameur, Adam ;
Wetterbom, Anna ;
Feuk, Lars ;
Gyllensten, Ulf .
GENOME BIOLOGY, 2010, 11 (03)
[2]   Detection of splice junctions from paired-end RNA-seq data by SpliceMap [J].
Au, Kin Fai ;
Jiang, Hui ;
Lin, Lan ;
Xing, Yi ;
Wong, Wing Hung .
NUCLEIC ACIDS RESEARCH, 2010, 38 (14) :4570-4578
[3]   De novo transcriptome assembly with ABySS [J].
Birol, Inanc ;
Jackman, Shaun D. ;
Nielsen, Cydney B. ;
Qian, Jenny Q. ;
Varhol, Richard ;
Stazyk, Greg ;
Morin, Ryan D. ;
Zhao, Yongjun ;
Hirst, Martin ;
Schein, Jacqueline E. ;
Horsman, Doug E. ;
Connors, Joseph M. ;
Gascoyne, Randy D. ;
Marra, Marco A. ;
Jones, Steven J. M. .
BIOINFORMATICS, 2009, 25 (21) :2872-2877
[4]   Into the unknown: expression profiling without genome sequence information in CHO by next generation sequencing [J].
Birzele, Fabian ;
Schaub, Jochen ;
Rust, Werner ;
Clemens, Christoph ;
Baum, Patrick ;
Kaufmann, Hitto ;
Weith, Andreas ;
Schulz, Torsten W. ;
Hildebrandt, Tobias .
NUCLEIC ACIDS RESEARCH, 2010, 38 (12) :3999-4010
[5]   Problems and Solutions for Estimating Indel Rates and Length Distributions [J].
Cartwright, Reed A. .
MOLECULAR BIOLOGY AND EVOLUTION, 2009, 26 (02) :473-480
[6]   Optimal spliced alignments of short sequence reads [J].
De Bona, Fabio ;
Ossowski, Stephan ;
Schneeberger, Korbinian ;
Raetsch, Gunnar .
BIOINFORMATICS, 2008, 24 (16) :I174-I180
[7]   Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data [J].
Degner, Jacob F. ;
Marioni, John C. ;
Pai, Athma A. ;
Pickrell, Joseph K. ;
Nkadori, Everlyne ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
BIOINFORMATICS, 2009, 25 (24) :3207-3212
[8]  
Flicek P, 2009, NAT METHODS, V6, pS6, DOI [10.1038/NMETH.1376, 10.1038/nmeth.1376]
[9]  
Griffith M, 2010, NAT METHODS, V7, P843, DOI [10.1038/NMETH.1503, 10.1038/nmeth.1503]
[10]   Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs [J].
Guttman, Mitchell ;
Garber, Manuel ;
Levin, Joshua Z. ;
Donaghey, Julie ;
Robinson, James ;
Adiconis, Xian ;
Fan, Lin ;
Koziol, Magdalena J. ;
Gnirke, Andreas ;
Nusbaum, Chad ;
Rinn, John L. ;
Lander, Eric S. ;
Regev, Aviv .
NATURE BIOTECHNOLOGY, 2010, 28 (05) :503-U166