A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae

被引:220
作者
Nookaew, Intawat [1 ]
Papini, Marta [1 ]
Pornputtapong, Natapol [1 ]
Scalcinati, Gionata [1 ]
Fagerberg, Linn [2 ]
Uhlen, Matthias [2 ,3 ]
Nielsen, Jens [1 ,3 ]
机构
[1] Chalmers Univ Technol, Novo Nordisk Fdn Ctr Biosustainabil, Dept Chem & Biol Engn, SE-41296 Gothenburg, Sweden
[2] Royal Inst Technol, Novo Nordisk Fdn Ctr Biosustainabil, Dept Biotechnol, SE-10691 Stockholm, Sweden
[3] Tech Univ Denmark, Novo Nordisk Fdn Ctr Biosustainabil, DK-2970 Horsholm, Denmark
基金
欧洲研究理事会;
关键词
MESSENGER-RNA; GENOME; ALIGNMENT; QUANTIFICATION; METABOLISM; ALGORITHMS; LANDSCAPE; FRAMEWORK; GENOTYPE; MODEL;
D O I
10.1093/nar/gks804
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
RNA-seq, has recently become an attractive method of choice in the studies of transcriptomes, promising several advantages compared with microarrays. In this study, we sought to assess the contribution of the different analytical steps involved in the analysis of RNA-seq data generated with the Illumina platform, and to perform a cross-platform comparison based on the results obtained through Affymetrix microarray. As a case study for our work we, used the Saccharomyces cerevisiae strain CEN.PK 113-7D, grown under two different conditions (batch and chemostat). Here, we asses the influence of genetic variation on the estimation of gene expression level using three different aligners for read-mapping (Gsnap, Stampy and TopHat) on S288c genome, the capabilities of five different statistical methods to detect differential gene expression (baySeq, Cuffdiff, DESeq, edgeR and NOISeq) and we explored the consistency between RNA-seq analysis using reference genome and de novo assembly approach. High reproducibility among biological replicates (correlation >= 0.99) and high consistency between the two platforms for analysis of gene expression levels (correlation >= 0.91) are reported. The results from differential gene expression identification derived from the different statistical methods, as well as their integrated analysis results based on gene ontology annotation are in good agreement. Overall, our study provides a useful and comprehensive comparison between the two platforms (RNA-seq and microrrays) for gene expression analysis and addresses the contribution of the different steps involved in the analysis of RNA-seq data.
引用
收藏
页码:10084 / 10097
页数:14
相关论文
共 58 条
[1]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[2]   3′ tag digital gene expression profiling of human brain and universal reference RNA using Illumina Genome Analyzer [J].
Asmann, Yan W. ;
Klee, Eric W. ;
Thompson, E. Aubrey ;
Perez, Edith A. ;
Middha, Sumit ;
Oberg, Ann L. ;
Therneau, Terry M. ;
Smith, David I. ;
Poland, Gregory A. ;
Wieben, Eric D. ;
Kocher, Jean-Pierre A. .
BMC GENOMICS, 2009, 10 :531
[3]   A comparison of massively parallel nucleotide sequencing with oligonucleotide microarrays for global transcription profiling [J].
Bradford, James R. ;
Hey, Yvonne ;
Yates, Tim ;
Li, Yaoyong ;
Pepper, Stuart D. ;
Miller, Crispin J. .
BMC GENOMICS, 2010, 11
[4]   Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments [J].
Bullard, James H. ;
Purdom, Elizabeth ;
Hansen, Kasper D. ;
Dudoit, Sandrine .
BMC BIOINFORMATICS, 2010, 11
[5]   Integrated multilaboratory systems biology reveals differences in protein metabolism between two reference yeast strains [J].
Canelas, Andre B. ;
Harrison, Nicola ;
Fazio, Alessandro ;
Zhang, Jie ;
Pitkanen, Juha-Pekka ;
van den Brink, Joost ;
Bakker, Barbara M. ;
Bogner, Lara ;
Bouwman, Jildau ;
Castrillo, Juan I. ;
Cankorur, Ayca ;
Chumnanpuen, Pramote ;
Daran-Lapujade, Pascale ;
Dikicioglu, Duygu ;
van Eunen, Karen ;
Ewald, Jennifer C. ;
Heijnen, Joseph J. ;
Kirdar, Betul ;
Mattila, Ismo ;
Mensonides, Femke I. C. ;
Niebel, Anja ;
Penttila, Merja ;
Pronk, Jack T. ;
Reuss, Matthias ;
Salusjarvi, Laura ;
Sauer, Uwe ;
Sherman, David ;
Siemann-Herzberg, Martin ;
Westerhoff, Hans ;
de Winde, Johannes ;
Petranovic, Dina ;
Oliver, Stephen G. ;
Workman, Christopher T. ;
Zamboni, Nicola ;
Nielsen, Jens .
NATURE COMMUNICATIONS, 2010, 1
[6]   Stem cell transcriptome profiling via massive-scale mRNA sequencing [J].
Cloonan, Nicole ;
Forrest, Alistair R. R. ;
Kolle, Gabriel ;
Gardiner, Brooke B. A. ;
Faulkner, Geoffrey J. ;
Brown, Mellissa K. ;
Taylor, Darrin F. ;
Steptoe, Anita L. ;
Wani, Shivangi ;
Bethel, Graeme ;
Robertson, Alan J. ;
Perkins, Andrew C. ;
Bruce, Stephen J. ;
Lee, Clarence C. ;
Ranade, Swati S. ;
Peckham, Heather E. ;
Manning, Jonathan M. ;
McKernan, Kevin J. ;
Grimmond, Sean M. .
NATURE METHODS, 2008, 5 (07) :613-619
[7]   SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data [J].
Cox, Murray P. ;
Peterson, Daniel A. ;
Biggs, Patrick J. .
BMC BIOINFORMATICS, 2010, 11
[8]   Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data [J].
Degner, Jacob F. ;
Marioni, John C. ;
Pai, Athma A. ;
Pickrell, Joseph K. ;
Nkadori, Everlyne ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
BIOINFORMATICS, 2009, 25 (24) :3207-3212
[9]   Compatibility with Killer Explains the Rise of RNAi-Deficient Fungi [J].
Drinnenberg, Ines A. ;
Fink, Gerald R. ;
Bartel, David P. .
SCIENCE, 2011, 333 (6049) :1592-1592
[10]   A second generation human haplotype map of over 3.1 million SNPs [J].
Frazer, Kelly A. ;
Ballinger, Dennis G. ;
Cox, David R. ;
Hinds, David A. ;
Stuve, Laura L. ;
Gibbs, Richard A. ;
Belmont, John W. ;
Boudreau, Andrew ;
Hardenbol, Paul ;
Leal, Suzanne M. ;
Pasternak, Shiran ;
Wheeler, David A. ;
Willis, Thomas D. ;
Yu, Fuli ;
Yang, Huanming ;
Zeng, Changqing ;
Gao, Yang ;
Hu, Haoran ;
Hu, Weitao ;
Li, Chaohua ;
Lin, Wei ;
Liu, Siqi ;
Pan, Hao ;
Tang, Xiaoli ;
Wang, Jian ;
Wang, Wei ;
Yu, Jun ;
Zhang, Bo ;
Zhang, Qingrun ;
Zhao, Hongbin ;
Zhao, Hui ;
Zhou, Jun ;
Gabriel, Stacey B. ;
Barry, Rachel ;
Blumenstiel, Brendan ;
Camargo, Amy ;
Defelice, Matthew ;
Faggart, Maura ;
Goyette, Mary ;
Gupta, Supriya ;
Moore, Jamie ;
Nguyen, Huy ;
Onofrio, Robert C. ;
Parkin, Melissa ;
Roy, Jessica ;
Stahl, Erich ;
Winchester, Ellen ;
Ziaugra, Liuda ;
Altshuler, David ;
Shen, Yan .
NATURE, 2007, 449 (7164) :851-U3