A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae

被引:220
作者
Nookaew, Intawat [1 ]
Papini, Marta [1 ]
Pornputtapong, Natapol [1 ]
Scalcinati, Gionata [1 ]
Fagerberg, Linn [2 ]
Uhlen, Matthias [2 ,3 ]
Nielsen, Jens [1 ,3 ]
机构
[1] Chalmers Univ Technol, Novo Nordisk Fdn Ctr Biosustainabil, Dept Chem & Biol Engn, SE-41296 Gothenburg, Sweden
[2] Royal Inst Technol, Novo Nordisk Fdn Ctr Biosustainabil, Dept Biotechnol, SE-10691 Stockholm, Sweden
[3] Tech Univ Denmark, Novo Nordisk Fdn Ctr Biosustainabil, DK-2970 Horsholm, Denmark
基金
欧洲研究理事会;
关键词
MESSENGER-RNA; GENOME; ALIGNMENT; QUANTIFICATION; METABOLISM; ALGORITHMS; LANDSCAPE; FRAMEWORK; GENOTYPE; MODEL;
D O I
10.1093/nar/gks804
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
RNA-seq, has recently become an attractive method of choice in the studies of transcriptomes, promising several advantages compared with microarrays. In this study, we sought to assess the contribution of the different analytical steps involved in the analysis of RNA-seq data generated with the Illumina platform, and to perform a cross-platform comparison based on the results obtained through Affymetrix microarray. As a case study for our work we, used the Saccharomyces cerevisiae strain CEN.PK 113-7D, grown under two different conditions (batch and chemostat). Here, we asses the influence of genetic variation on the estimation of gene expression level using three different aligners for read-mapping (Gsnap, Stampy and TopHat) on S288c genome, the capabilities of five different statistical methods to detect differential gene expression (baySeq, Cuffdiff, DESeq, edgeR and NOISeq) and we explored the consistency between RNA-seq analysis using reference genome and de novo assembly approach. High reproducibility among biological replicates (correlation >= 0.99) and high consistency between the two platforms for analysis of gene expression levels (correlation >= 0.91) are reported. The results from differential gene expression identification derived from the different statistical methods, as well as their integrated analysis results based on gene ontology annotation are in good agreement. Overall, our study provides a useful and comprehensive comparison between the two platforms (RNA-seq and microrrays) for gene expression analysis and addresses the contribution of the different steps involved in the analysis of RNA-seq data.
引用
收藏
页码:10084 / 10097
页数:14
相关论文
共 58 条
[51]   EFFECT OF BENZOIC-ACID ON METABOLIC FLUXES IN YEASTS - A CONTINUOUS-CULTURE STUDY ON THE REGULATION OF RESPIRATION AND ALCOHOLIC FERMENTATION [J].
VERDUYN, C ;
POSTMA, E ;
SCHEFFERS, WA ;
VANDIJKEN, JP .
YEAST, 1992, 8 (07) :501-517
[52]   RNA-Seq: a revolutionary tool for transcriptomics [J].
Wang, Zhong ;
Gerstein, Mark ;
Snyder, Michael .
NATURE REVIEWS GENETICS, 2009, 10 (01) :57-63
[53]   Defining transcribed regions using RNA-seq [J].
Wilhelm, Brian T. ;
Marguerat, Samuel ;
Goodhead, Ian ;
Bahler, Jurg .
NATURE PROTOCOLS, 2010, 5 (02) :255-266
[54]   RNA-Seq-quantitative measurement of expression through massively parallel RNA-sequencing [J].
Wilhelm, Brian T. ;
Landry, Josette-Renee .
METHODS, 2009, 48 (03) :249-257
[55]  
Workman C., 2002, Genome Biol, V3, P0048
[56]   GMAP: a genomic mapping and alignment program for mRNA and EST sequences [J].
Wu, TD ;
Watanabe, CK .
BIOINFORMATICS, 2005, 21 (09) :1859-1875
[57]   Fast and SNP-tolerant detection of complex variants and splicing in short reads [J].
Wu, Thomas D. ;
Nacu, Serban .
BIOINFORMATICS, 2010, 26 (07) :873-881
[58]   Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study [J].
Zhao, Qiong-Yi ;
Wang, Yi ;
Kong, Yi-Meng ;
Luo, Da ;
Li, Xuan ;
Hao, Pei .
BMC BIOINFORMATICS, 2011, 12 :S2